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Preface 


Number theory, an abstruse specialty 
concerned with the properties of 
whole numbers. 

TIME, April 4, 1983. 


The texts to be examined in this volume range from an 
Old Babylonian tablet, datable to the time of Hammurapi 
or thereabouts, to Legendre’s Essai sur la Théorie des Nombres 
of 1798. In the main, the volume stops short of Gauss’s 
Disquisitiones of 1801, even though it includes an episode 
from Legendre’s later career and does not avoid relevant 
references to the discoveries of Gauss and his successors. 

Until rather recently, number theory, or arithmetic as 
some prefer to call it, has been conspicuous for the quality 
rather than for the number of its devotees; at the same time 
it is perhaps unique in the enthusiasm it has inspired, an 
enthusiasm eloquently expressed in many utterances of such 
men as Euler, Gauss, Eisenstein, Hilbert. Thus, while this 
book covers some thirty-six centuries of arithmetical work, 
its bulk consists in nothing more than a detailed study and 
exposition of the achievements of four mathematicians: 
Fermat, Euler, Lagrange, Legendre. These are the founders 
of modern number theory. The greatness of Gauss lies in 
his having brought to completion what his predecessors 
had initiated, no less than in his inaugurating a new era in 
the history of the subject. 

Our main task will be to take the reader, so far as prac- 
ticable, into the workshop of our authors, watch them at 
work, share their successes and perceive their failures. 
Luckily this does not require any deep delving into archives 
and manuscript collections. As the bibliography will witness, 


x Preface 


almost all the mathematicians to be mentioned in this volume 
have had their complete works, and their surviving cor- 
respondence, excellently edited and published. I have also 
had the good fortune of being able to consult the original 
editions of Viéte, Bachet, Fermat, Wallis, Legendre in the 
Rosenwald Rare Book Collection at the Institute for Ad- 
vanced Study. 

The method to be followed here is historical throughout; 
no specific knowledge is expected of the reader, and it is 
the author’s fond hope that some readers at least will find 
it possible to get their initiation into number theory by fol- 
lowing the itinerary retraced in this volume. Some appro- 
priate background may be obtained from the author’s Number 
theory for beginners (whose content, incidentally, is almost 
entirely taken from Euler), or also (at a more sophisticated 
level) from Chapter I of J.-P. Serre’s Cours d’Arith- 
métique. More detailed background information, along with 
some supplementary material (not of strictly historical 
character) will be supplied in the Appendices following 
Chapters II, II, IV. 

My warm thanks, and the reader’s, are due to my friend 
S. S. Chern for the beautiful calligraphy adorning the fron- 
tispiece. Thanks are also due to the University Museum, 
University of Pennsylvania for permission to reproduce the 
photograph of the Tang horse in their collection; to 
O. Neugebauer for the photograph of the tablet PLIMPTON 
322; and to the Archives de Il’ Académie des Sciences, Paris, 
for the photograph of Lagrange’s portrait engraved by Del- 
pech. Finally I wish to express my gratitude to the Birkhauser 
Verlag, Boston, and more specifically to Klaus Peters and 
his team of collaborators for the keen interest they have 
taken in this project, their helpful hand on more than one 
occasion, and the high technical skill they have brought to 
bear upon all matters pertaining to the publication of this 
volume. 


Princeton, 15 April 1983 
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References and dates 


References to the above works are mostly given by in- 
dicating the volume (when necessary) and the page. Example: 
Fe.11.194 means Fermat’s @uvres (in the standard Tannery- 
Henry edition), vol.II, p.194. 

Exceptionally references to Euclid and to Diophantus are 
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Example: Eucl. VII.2 means Euclid (in the standard Heiberg 
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Diophantus (the Tannery edition), Book V, problem 11. In 
the case of Diophantus, it has occasionally been found nec- 
essary to include the numbering in Bachet’s edition of 1621 
(or, what amounts to the same, in S. de Fermat’s edition of 
1670; cf. Bibliography); this is indicated thus: Dioph.V.11 
= Dioph.V.14,, meaning that prob. 11, Book V, of Dio- 
phantus (Tannery’s edition) is numbered 14 in Bachet’s 
edition. 
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of Enestrém, vol.2 of Series I, pages 531 to 555, written 
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the date of presentation, either to the Petersburg or to the 
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that volume. 

Additional references will be found in the bibliography 
on pages 361—364. 


Notations 


Traditional algebraic notation has been used throughout; 
as it cannot be said to have been well established until late 
in the seventeenth century, it is anachronistic in the de- 
scription of the work of earlier writers (including Fermat, 
who used Viéte’s notations). From Euler on, our notation 
will usually conform to that of the original authors, except 
that the congruence notation, which only goes back to Gauss, 
has been adopted throughout as a convenient abbreviation. 
We recall that the “congruence” 


a = b(mod.m), 


with the “modulus” m, means that a — bis a multiple of m. 
Thus an integer a is a “quadratic residue” modulo m if there 
is b such that a = 6? (mod.m), and a “quadratic non-residue” 
otherwise; it is an n-th power residue if there is 6 such that 
a = 5b” (mod.m). 

Also for brevity’s sake, the matrix notation has been used 
occasionally (in Chapter III, § XIII, and Appendix III of 
Chap.IV); Z is used to designate the “ring” consisting of all 
integers (positive, negative and 0); Q, R, C, to designate 
the “fields” consisting respectively of all rational numbers, 
of all real numbers, and of all “imaginary” (or “complex”) 
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numbers a + b\/ —1 (where a, b are real); when p is any 
prime, F, is the “field” of p elements, consisting of the con- 
gruence classes modulo p. If 6 is a “quadratic irrationality”, 
actually VN, where Nisa non-square integer, or the “cubic 


2 
of all elements a + 56 where a and 6 are integers, and Q(@) 
is the “field” consisting of all elements r + s@, where r and 
§ are rational numbers. 

For reasons of typographical convenience, the Legendre 
symbol 


root of unity” , then Z[6] is the “ring” consisting 


(invariably so written by Legendre, Dirichlet and most classical 
authors) has frequently been printed as (n/p). It is the symbol, 
defined whenever p is a prime and n an integer prime to 
p, which has the value +1 when n is a quadratic residue 
modulo p, and — | otherwise. 


Chapter One 
Protohistory 


§I. 


According to JACOBI, the theory of elliptic functions 
was born between the twenty-third of December 1751 and 
the twenty-seventh of January 1752. On the former date, 
the Berlin Academy of Sciences handed over to EULER 
the two volumes of Marchese FAGNANO'’s Produzioni Ma- 
tematiche, published in Pesaro in 1750 and just received from 
the author; Euler was requested to examine the book and 
draft a suitable letter of thanks. On the latter date, Euler, 
referring explicitly to Fagnano’s work on the lemniscate, 
read to the Academy the first of a series of papers, eventually 
proving in full generality the addition and multiplication 
theorems for elliptic integrals. 

One might similarly try to record the date of birth of the 
modern theory of numbers; like the god Bacchus, however, 
it seems to have been twice-born. Its first birth must have 
occurred at some point between 1621 and 1636, probably 
closer to the latter date. In 1621, the Greek text of DIO- 
PHANTUS was published by BACHET, along with a useful 
Latin translation and an extensive commentary. It is not 
known when FERMAT acquired a copy of this book (the 
same one, no doubt, into whose margins he was later to jot 
down some of his best discoveries), nor when he began to 
read it; but, by 1636, as we learn from his correspondence, 
he had not only studied it carefully but was already devel- 
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oping ideas of his own about a variety of topics touched 
upon in that volume. 

From then on, “numbers”, i.e., number theory, never 
ceased to be among Fermat’s major interests; but his valiant 
efforts to gain friends for his favorite subject were not, on 
the whole, crowned with success. “There is no lack of better 
topics for us to spend our time on” was young HUYGENS’ 
comment to WALLIS (Huy.IL.211 = Fe.1V.121). At one 
time Fermat cherished the thought of devoting a whole 
book to number theory (Fe.I.305). On another occasion he 
tried to persuade PASCAL to collaborate with him in writing 
such a book (Fe.11.299—300); of course he realized that Pas- 
cal’s gifts of exposition were far superior to his own. To 
our great loss, Pascal showed no interest and politely declined 
the suggestion (Fe.II.314); his views may well have been 
similar to those of Huygens. After Fermat’s death in 1665, 
there was a great demand for a publication of his writings, 
hardly any of which had ever appeared in print. In 1670, 
his son Samuel published a reprint of Bachet’s Diophantus, 
along with Fermat’s marginal notes and an essay by the 
Jesuit father Jacques de BILLY on Fermat’s methods for 
solving certain types of so-called diophantine equations. 
This was followed in 1679 by the publication, also by Samuel, 
of a volume of his father’s Varia Opera, including a few 
letters of arithmetical content. But it took half a century 
for this to have any effect, and in the meanwhile number 
theory seemed to have died off. 

As to its rebirth, we can pinpoint it quite accurately. In 
1729, young EULER was in St. Petersburg, an adjunct of 
the newly founded Academy of Sciences; his friend and 
patron GOLDBACH was in Moscow. Their correspondence 
has been carefully preserved and was published by Euler’s 
great-grandson in 1843. Goldbach, in his own amateurish 
way, was keenly interested in mathematics and particularly 
in “numbers”; it is in a letter to Euler that he stated the 
conjecture to which his name has remained attached. On 
the first of December 1729, Goldbach asked Euler for his 
views about Fermat’s statement that all integers 2” + I are 
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primes (cf. infra, Chap.III, §IV). In his answer, Euler ex- 
presses some doubts; but nothing new occurs until the fourth 
of June, when Euler reports that he has “just been reading 
Fermat” and that he has been greatly impressed by Fermat’s 
assertion that every integer is a sum of four squares (and 
also of 3 triangular numbers, of 5 pentagonal numbers, 
etc.). After that day, Euler never lost sight of this topic and 
of number theory in general; eventually LAGRANGE fol- 
lowed suit, then LEGENDRE, then GAUSS with whom 
number theory reached full maturity. Although never a 
popular subject, it has been doing quite well ever since. 

Thus an account of number theory since Fermat can do 
full justice to the inner coherence of the topic as well as to 
the continuity of its development. In contrast with this, the 
mere fact that Fermat initially had to draw his main inspi- 
ration from a Greek author of the third century, only lately 
come to light, points to the entirely different character of 
much of the earlier mathematical work as well as to the 
frequent disappearance and re-appearance of essential 
sources of knowledge in former times. About ancient math- 
ematics (whether Greek or Mesopotamian) and medieval 
mathematics (Western or Oriental), the would-be historian 
must of necessity confine himself to the description of a 
comparatively small number of islands accidentally emerging 
from an ocean of ignorance, and to tenuous conjectural 
reconstructions of the submerged continents which at one 
time must have bridged the gaps between them. Lacking 
the continuity which seems essential to history, his work 
might better be described by some other name. It is not 
prehistory, since it depends on written sources; protohistory 
seems more appropriate. 

Of course new texts may turn up; they do from time to 
time. Our knowledge of ARCHIMEDES was greatly en- 
riched in 1906 by the discovery of a palimpsest in Istanbul. 
What survives of DIOPHANTUS consists of six chapters 
or “books”, while thirteen books are announced in the in- 
troduction; some new material pertaining to Diophantus, 
re-worked or perhaps translated from the original text, has 
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been found recently in an Arabic manuscript; more may 
yet be forthcoming. Important cuneiform texts may still be 
buried underground in Mesopotamia, or even more probably 
(according to Neugebauer) in the dusty basements of our 
museums. Arabic and Latin medieval manuscripts by the 
score await identification, even in well-explored libraries. 
Still, what hope is there of our ever getting, say, a full picture 
of early Greek geometry? In the third century B.C., EU- 
DEMOS (not himself a mathematician) wrote in four “books” 
a history of geometry, some fragments of which have been 
preserved. But what may have been the contents of his 
history of arithmetic, comprising at least two books, all but 
entirely lost? Even if part of it concerned topics which we 
might regard as algebra, some of it must have been number 
theory. To try to reconstruct such developments from hints 
and allusions found in the work of philosophers, even of 
those who professed a high regard for mathematics, seems 
as futile as would be an attempt to reconstruct Newton’s 
Principia out of the writings of Locke and Voltaire, or his 
differential calculus from the criticism of Bishop Berkeley. 

It will now be our purpose to describe briefly, without 
any claim to completeness, a few highlights from the scanty 
remains of number-theorists prior to the seventeenth cen- 
tury. I have tried to exclude what belongs more properly 
to algebra (for instance, the solution of linear equations and 
linear systems), but the distinction between the two topics 
is often far from clearcut. 


SII. 


Of all the topics occurring in ancient mathematics, perhaps 
the one which belongs most clearly to number theory con- 
cerns the basic multiplicative properties of positive integers; 
they receive a fairly full treatment in EUCLID’s books VII, 
VIII and IX. It is generally agreed upon that much, if not 
all, of the content of those books is of earlier origin, but 
little can be said about the history behind them. Some facts 
concerning divisibility must have been known in Mesopo- 
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tamia; any table of reciprocals in the sexagesimal system 
points clearly to the essential distinction between those in- 
tegers which contain no other prime than 2, 3 and 5, and 
all others. That the strictly additive treatment of fractions 
in Egyptian mathematics was eventually supplemented in 
Greece by a multiplicative treatment of the ratios between 
integers indicates a basic change of attitude which, according 
to Paul Tannery’s very plausible hypothesis, may well have 
had its source in musical theory. This may in turn have had 
some connection with early proofs for the irrationality of 
the simplest square roots, such as V2 and V5, but we do 
not know what those proofs were; if Aristotle, in the course 
of a discussion on the logical structure of proofs (Analytica 
Priora 1.23), hints at one proof for V2, this hardly gives us 
a right to credit it to some hypothetical “Pythagoreans”. 
The concept of a prime may have occurred quite early, 
together with that of a divisor and that of a common multiple 
for given integers; all we can say is that Plato, in his late 
work The Laws (737e—738a), mentions some properties of 
the number 5040, stressing the fact that it is a common 
multiple of numbers up to 10 (but so is 2520) and that it 
has 59 divisors, not counting 5040 itself; this points to some 
advanced knowledge about the factorization of integers 
among the mathematicians in Plato’s Academy, but how 
much cannot be ascertained. Was there originally a relation 
between the so-called “Euclidean algorithm”, as described 
in Eucl.VI1.1—2, for finding the g.c.d. of two integers, and 
the theory of the same process (Eucl.X.2) as it applies to 
possibly incommensurable magnitudes? Has it not often 
happened that a mathematical process has been discovered 
twice, in different contexts, long before the substantial 
identity between the two discoveries came to be perceived? 
Some of the major advances in mathematics have occurred 
just in this manner. 

Even in Euclid, we fail to find a general statement about 
the uniqueness of the factorization of an integer into primes; 
surely he may have been aware of it, but all he has is a 
statement (Eucl.[X.14) about the l.c.m. of any number of 
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given primes. Finally, the proof for the existence of infinitely 
many primes (Eucl.1X.20) represents undoubtedly a major 
advance, but there is no compelling reason either for at- 
tributing it to Euclid or for dating it back to earlier times. 
What matters for our purposes is that the very broad dif- 
fusion of Euclid in later centuries, while driving out all 
earlier texts, made this body of knowledge widely available 
to mathematicians from then on. 


SILI. 


Magical or mystical properties of numbers occur in many 
cultures. Somehow, either in Greece or earlier, the idea of 
perfection attached itself to those integers which are equal 
to the sum of their divisors. The last theorem in the arith- 
metical books of Euclid (Eucl.1X.36), and possibly, in their 
author’s view, the apex of his number-theoretical work, as- 
serts that 2"(2”*'—1) is perfect if the second factor is a 
prime. The topic, along with some of its extensions (such 
as the pairs of “amicable” numbers), occurs sporadically in 
later work, perhaps because of the special appeal of the 
words designating those concepts. It is of little theoretical 
importance and would not have to be mentioned here, were 
it not for the fact that it did attract a good deal of attention 
among some of Fermat’s contemporaries, such as Mersenne 
and Frenicle, including even Fermat himself, and played 
some part in his early investigations (cf. infra, Chap.II, §1V). 


SIV. 


Indeterminate equations of the first degree, to be solved 
in integers, must have occurred quite early in various cul- 
tures, either as puzzles (as exemplified by various epigrams 
in the Greek Anthology; cf. Dioph., vol. I, pp. 43-72), or, 
more interestingly for the mathematician, as calendar 
problems. A typical problem of this kind may be formulated 
as a double congruence 


x = p (mod a), x = q (mod 5), 
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or as the linear congruence ax = m (mod 6), or as an equation 
ax — by = min integers. The general method of solution 
for this is essentially identical with the “Euclidean algorithm” 
for finding the g.c.d. of @ and 6b (Euc/.VII.2) or also (in 
modern terms) with the calculation of the continued fraction 
for a/b; the relation between the two problems is indeed so 
close that whoever knows how to solve the one can hardly 
fail to solve the other if the need for it arises. Nevertheless, 
if we leave China aside, the first explicit description of the 
general solution occurs in the mathematical portion of the 
Sanskrit astronomical work Aryabhatiya, of the fifth—sixth 
century A.D. (cf. e.g. Datta and Singh, History of Hindu 
Mathematics, Lahore 1938, vol. II, pp. 93-99). In later Sanskrit 
texts this became known as the kuttaka (= “pulverizer”) 
method; a fitting name, recalling to our mind Fermat’s “in- 
finite descent”. As Indian astronomy of that period is largely 
based on Greek sources, one is tempted to ascribe the same 
origin to the kuttaka, but of course proofs are lacking. 

Then, in 1621, Bachet, blissfully unaware (of course) of 
his Indian predecessors, but also of the connection with the 
seventh book of Euclid, claimed the same method em- 
phatically as his own in his comments on Dioph.1V.41, (=IV, 
lemma to 36), announcing that it was to be published in a 
book of arithmetical “elements”; as this never appeared, he 
inserted it in the second edition of his Problémes plaisants et 
délectables (Lyon 1624), which is where Fermat and Wallis 
found it; both of them, surely, knew their Euclid too well 
not to recognize the Euclidean algorithm there. 


§V. 


There is no need to try to enumerate here the many 
sources, both Mesopotamian and Greek, for the summation 
of arithmetic progressions (such as =i'n), geometric pro- 
gressions (such as 292") and arithmetic progressions of 
higher order (such as =n"); so far as Greece is concerned, 
this cannot be separated from the consideration of “fig- 
urative” numbers. The most primitive kind of tabulation 
leads directly to not a few formulas of this kind (e.g. to 
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n® = the sum of the first n odd numbers) which can then 
also be verified by suitable diagrams. Such results must have 
become fairly universally known at a comparatively early 
date; invoking the name of Pythagoras adds little to our 
understanding of the matter. 

The same may be said about “pythagorean” triangles, a 
phrase by which we will merely understand those triples of 
integers (a, b, c) which satisfy the condition 


(1) e+P=ac 


and which therefore, if the geometrical theorem known to 
us under the name of Pythagoras is assumed, measure the 
sides of a right-angled triangle. Of course, if (a, b, c) is such, 
so is (0, a, c); allowing for this permutation, one finds that 
the general solution of (1) is given by 


(2) a=dx 2bq, b=dx (p’-@), c=ad x (p?+q?), 


where p, g are mutually prime, and p — q is odd and >0; 
dis then the g.c.d. of a, 6, c. Taking d = 1, one gets (up to 
the same permutation as before) the general solution of (1) 
in mutually prime integers. The simplest such triple is of 
course (3, 4, 5), which seems to have belonged to a kind of 
mathematical folklore from very early times. 

A table of fifteen pythagorean triples has been preserved 
in the Old Babylonian tablet PLIMPTON 322, published 
by O. Neugebauer and A. Sachs (Mathematical Cuneiform 
Texts, New Haven 1945, pp. 38-41) and dated by them to 
between 1900 and 1600 B.C. Perhaps it was calculated as 
an aid to trigonometry; it must have been based on some 
formula, either (2) itself or more simply the relation 


a® = (c+b\(c—b); 


the values for a are all “regular” numbers (containing no 
prime factor other than 2, 3, 5), while the values of b?/a?, 
listed in decreasing order, range at fairly regular intervals 
from nearly 1 down to almost 4. 

It is, alas, all too easy to ascribe similar knowledge to “the 
Pythagoreans”; taking Proclus’s word for it merely begs the 
question. It is then no less easy to assert, either that they 
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re-discovered it, or (more plausibly perhaps) that it came 
to them through unknown channels from Mesopotamia. 
Certain it is that the above formulas for constructing py- 
thagorean triples were known to Euclid, who gave a proof 
for them in a lemma to Eucl.X.29; five centuries later, Dio- 
phantus is so familiar with them that he has a technical term 
(mA\aocew, “to shape”) for the construction of the triangle 
(2pq, p° — 4°, p’ +q°) from a pair of integers (p, g). Whatever 
may have been the fate of that topic during the Middle 
Ages, it is in Diophantine garb that it emerges again in 
BOMBELLI’s Algebra of 1572, whose Book III is almost 
entirely based on Bombelli’s perusal of a manuscript of 
Diophantus in the Vatican library. The topic is then taken 
up again by VIETE, at first in his Notae priores (copies of 
which circulated long before they were printed in 1631) 
and then in his Zetetica of 1593; both are also largely based 
on Diophantus, whom Viéte could have read in Xylander’s 
Latin translation of 1575 but had more probably studied 
in a Greek manuscript in the royal library in Paris. Viéte’s 
work, which exhibits more originality than Bombelli’s in its 
handling of Diophantine material, will be discussed in greater 
detail further on. 


§VI. 


Whether one is interested only in pythagorean triangles, 
or more generally in sums of two squares, there inevitably 
comes a point where the algebraic identity 


(3) (x? +y)(z? +07) = (xztyt)? + (xtFyz)® 

has to play a role. In particular, because of the double sign 
in the right-hand side, it appears whenever one wishes to 
construct numbers which can be written as the sum of two 


squares in more than one way. For z = t = I, one gets the 
special case 


(4) Q(x? +y*) = (xty)? + (ey), 


which, dressed up in geometric garb, appears in Fucl.I1.9-10. 
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The identity (3) must have been familiar to Diophantus, 
as shown by the following passage (Dioph. 111.19): 

“It is in the nature of 65 that it can be written in two different 
ways as a sum of two squares, viz., as 16 + 49 and as 64 + 1; 
this happens because it is the product of 13 and 5, each of which 
is a sum of two squares.” 

Since Diophantus has several references to a lost book of 
“porisms” (i.e., auxiliary theorems), and since the case y = 
t=1,x —z= +1 of (3), ie., the identity 


(x? + 1)(22 +1) = (xz+ 1)? + 1 
or rather, equivalently: 
x2? tox? + 2 = (xz+1)P 


for x — z = +1, seems implicit in one such reference 
(Dioph.V .5; cf. Dioph.III.15), it is perhaps not too far-fetched 
to imagine that (3) may have been one of his porisms. This 
must have been Bachet’s view when he included (3) as 
Prop.III.7 of his own Porisms in his Diophantus. However 
that may be, while it would seem unreasonable, in view of 
the above-quoted passage, to deny a knowledge of (3) to 
Diophantus, no explicit statement of that identity appears 
to be known until we find one, with an elaborate proof for 
it, in Fibonacci’s Liber Quadratorum of 1225 (Leon.11.257— 
260 = LVE.prop.IV). Fibonacci claims no credit for it, and 
seems rather to treat it as a result, well-known to specialists 
but deserving wider diffusion. 

In contrast with Fibonacci’s more popular Liber Abact, his 
Liber Quadratorum fell into deep oblivion; it was with great 
difficulty that a copy of it was located by prince Boncompagni, 
who published it in 1856. As noted above, Bombelli, in his 
Algebra of 1572, took up many of the problems he had 
copied out of Diophantus; in particular, problem CXXVI 
of his Book IIT is translated from Dioph.II1.19 and includes 
the passage quoted above about 65 being twice a sum of 
two squares; but he has no comment to indicate whether 
he understood what was involved. The decisive step was 
taken by Viéte, who applied (3) to the construction of two 
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new rectangular triangles from two given ones, the operation 
being called by him “synaeresis” or “diaeresis” according to 
the sign used in (3) (cf. Notae priores, prop.XLVI = Op. p. 
34; Zet.1V.2 = Op. p. 62-63). Incidentally, Viéte, far from 
confining himself to pythagorean triples and to integers, is 
well aware that his constructions and identities hold for 
arbitrary quantities. He also does not fail to point out (Notae 
priores, prop.XLVIII) the relation between (3) and the ad- 
dition and subtraction formulas for trigonometric functions, 
a relation which becomes clear to us if we put 


x=rcosa, y=rsine, z=scosB, t=ssinB, 
and hence: 
xz + yt = rscos (a + B), xt + yz = rs sin (a@ ¥ B); 


and he introduces the term of “angle-duplication” for the 
case (x, y) = (z, t). As this shows, Viéte was interested in 
algebra and trigonometry much more than in number theory 
(cf. also his Theoremata ad Angul.Sect. = Op. pp. 287-304). 


§VII. 


A few more words about Fibonacci and his Liber Qua- 
dratorum will not be out of place here. Fibonacci, also known 
as Leonardo Pisano, was a well-traveled, well-informed and 
undoubtedly well-read mathematician. As a member of the 
international merchant community into which he was born, 
he was at home in the Mediterranean world from East to 
West. Born in Pisa, presumably around 1170, he had lived 
in North Africa and in Constantinople and had at least 
visited Sicily, Provence, Syria and Egypt during the latter 
part of the twelfth century, gathering information wherever 
he went from the most learned mathematicians of the day, 
before settling down to an enviable position as mathematical 
expert and teacher in his native city. When the emperor 
Frederic II sojourned there, Leonardo was of course brought 
to him and introduced into the court circle, at that time a 
meeting point for Latin, Arabic and Greek culture; it is 
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hardly fanciful to surmise that Leonardo himself had some 
degree of familiarity with all three languages. In the fashion 
of the times, he held disputations in Frederic’s presence 
with the learned men of the imperial entourage. On one 
occasion he was challenged to find three squares in arithmetic 
progression with the difference 5, i.e., in modern algebraic 
shorthand, to solve 


yore =2-yP=5 
in rational numbers, or, what amounts to the same: 
y? — X? = Z? — y? = 57? 


in integers. This became the object of the Liber Quadratorum. 

Problems about squares in arithmetic progression are quite 
old; because of identity (4), ie., because of Eucl.11.9-10, 
finding such squares is equivalent to finding pythagorean 
triangles. In fact, if X°, Y®, Z’ are in arithmetic progression, 
we have 

Y? = 4X?+Z7) =U +V 

with U = $(X+Z), V = 4(Z—X), and the difference Y* — X? 
= Z? — Y* has the value $(Z?—X”) = 2UV, which is four 
times the area of the triangle (U, V, Y). There is nothing 
to tell us that this was known to Leonardo’s challenger, or 
to Leonardo himself. But is it mere coincidence that the 
problem “Find a pythagorean triangle of area 5m”” occurs 
in a Byzantine manuscript of the eleventh or twelfth century, 
now in the Old Palace library in Istanbul? It has been pub- 
lished by Heiberg (Bibl.Math.(III) 8 (1907-08), p. 122); 
Leonardo could have seen it in Constantinople. The writer 
of that problem knows that the area of a pythagorean triangle 
is a multiple of 6; “therefore”, he says, “we must take for 
ma multiple of 6”; taking m = 6, he writes down, without 
any further explanation, the triangle (9, 40, 41), of area 
180 = 5 X 6°. The corresponding squares in arithmetic 
progression would be 317, 412, 497, which already occur as 
such in Dioph.111.7. 

On the other hand, Leonardo applies to his problem an 
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attractive and apparently original method, which consists 
in treating systematically each integral square n” as the sum 
of the first n odd integers, this being illustrated at each step 
by suitable diagrams. His first main result (Leon.I1.265- 
271 = LVE.prop.XI—XIJ) amounts to saying that Y* — X? 
= Z? — Y® = D has a solution in integers X, Y, Z if and 
only if D is of the form 


D = Ad*pq(p*—q°) = 4d"pq(p + qb) 
where d, p, g are as in (2) above; of course this is four times 
the area of the triangle given by (2). Then he shows that D 
must be a multiple of 24, and finally he solves his problem 
by taking d = 1, p = 5,q = 4, corresponding to D = 5 x 12? 
and to the same squares 31”, 41”, 49” as above. In conclusion, 
he observes that other integers besides 5 have the same 
property (e.g. 7, as appears by taking p = 16, q = 9), but 
that many do not, and he asserts that D can never be a 
square, giving for this a totally inadequate reason; that this 


is indeed so was to be one of Fermat’s major discoveries (cf. 
Chap.II, §X). 


§VILII. 


Just as the identity (3) plays a major role in any discussion 
of sums of two squares, the identities 
(5) (x®— Ny’)? - NE) = (xz+ Nyt)? — N(xt+ yz)? 
(6) (x? + Ny*)(z2+NP) = (xz = Nyt)? + N(xt¥yz)? 
where N is a positive integer, have a similar role to play in 
problems depending upon the quadratic forms X* + NY?. 
Of course, from a modern algebraic standpoint, they do 
not really differ from (3), but this could not have been fully 


realized until the eighteenth century. For us, perhaps the 
easiest way of verifying (5) and (6) is to write 


(x +yVN)(z+tVN) = (xz Nyt) + (xt + yz) VN, 
or the same identity with V—N substituted for VN, and 
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then to multiply both sides by the conjugate quantity obtained 
by substituting — V+N for V +N; this was first pointed out 
by Euler in his Algebra of 1770 (Eu.1-1.422, Art.175). Since 
Euclid devotes the whole of his Book X to the theory of 
quadratic irrationalities, one feels tempted to imagine that 
he, or one of his successors, might have considered some 
such derivation for (5), or at least for special cases of (5). 
Not only is Euclid himself well aware of the relation 


(VrtVs\(Vr- Vs) = 1 5, 
but even the identity 


1 vr Ns 


Vite ros 7-3 


may be regarded as the essential content of prop. 112 of 
that book. Unfortunately, Euclid’s motivation in Book X 
seems to have been the wish to construct a general framework 
for the theory of regular polygons and polyhedra, and not, 
as modern mathematicians would have it, an algebraic theory 
of quadratic fields. So we are left to speculate idly whether, 
in antiquity or later, identities involving square roots may 
not have been used, at least heuristically, in arithmetical 
work. Certain it is that, as late as the eighteenth century, 
Euler and Lagrange found it appropriate to congratulate 
each other on the original idea of introducing imaginary 
square roots into number theory (cf. fra, Chap. III, §XIV). 

One is on somewhat firmer ground in assuming that 
problems of the type of x” — Ny* = +m, for given positive 
integers N and m, must have occurred rather early in Greek 
mathematics, presumably in connection with the problem 
of obtaining good rational approximations for VN when N 
is not a square. Clearly, if x° — Ny? = +m and if x and y 
are large in comparison with m, the ratio of x to y gives a 
good approximation for VN, asis apparent from the identity 


x — 1 x* — Ny® 
N — —__ 
y yx + yVN 
which is a special case of (7), i.e., of Eucl.X.112. Thus, when 
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Eutocius, in his commentary on Archimedes (Arch.I11.234 
and 246), wishes to verify the validity of Archimedes’ ap- 
proximations 265:153 and 1351:780 for V3, he merely 
writes: 


265° — 3 x 153° = —2, 13517 - 3 x 780? = 1. 


Now (5) gives an easy method for constructing such so- 
lutions of x* — Ny? = + mas soonas one knows one of them 
and a solution of p> — Nq? = +1, or, under suitable cir- 
cumstances, = + 2 (cf. infra, 81X). For instance, Archimedes’ 
approximate values for V3 are best explained by assuming 
that he was applying systematically the formula 


(5x+9y)2 — 3 x (3x45y)? = —2 x (x2—3y%, 
A) y 


i.e., the special case N = 3,z = 5,t = 3 of (5) (cf. W. Knorr, 
Arch f. Hist.of Ex.Sc. 15 (1975), pp. 137-138). Here one should 
note that, if x? — 3y? = —2, both x and y must be odd and 
5x + 9y and 3x + 5y are even; the above relation becomes 


then 
a 2 2 


In fact, Archimedes’ approximations for V3 are all obtained 
by starting with 5? — 3 x 3° = —2and applying alternately 
the substitutions . 


+ + 
(x,9) > (3 4 an 2) 


and 
(x, y) — (5x + 9y, 3x + 5y). 


Another example of the same method, presumably of 
earlier date but first attested in the second century A.D. by 
Theon of Smyrna, is given by the so-called “side and diagonal 
numbers”, i.e., the successive solutions of x7 — Qy? =i] 
derived from the obvious one x = y = 1 by iterating the 
substitution 


(x, y) > (xt 2y, x+y). 


Protohistory 17 


The effectiveness of this procedure depends upon the 
identity 


(x+ 2y)? -—-2*x (x+y)? = — (x? — 2Qy"); 


this is the special case N = 2,z = ¢ = 1 of (5), but it is also 
equivalent to (4), i.e., to Eucl.11.9-10. The name given to 
those numbers emphasizes the relationship with the ratio 
V2 of the diagonal of the square to its side. 

We again find at least traces of the above identities in 
Diophantus, where cases y = 1,¢ = l or 2,xt —-z = +1 
of both (5) and (6) seem implicit in Dioph.V.3 and Dioph.V.4; 
in the former passage there is an explicit reference to the 
Porisms, so that we are again left to surmise that those Porisms 
may have included many such identities, perhaps even (5) 
and (6); that they contained quite sophisticated algebraic 
identities is clear from the references in Dioph.V.5 and V.16. 
While this must remain conjectural, it is a fact that we find 
an explicit statement of (5) in the work of the Indian as- 
tronomer and mathematician Brahmagupta in the seventh 
century, of course in connection with problems of the form 
x? — Ny? = +m (Col., p. 363; cf. Datta and Singh, loc.cit., 
vol.II, pp. 146-148). Here, too, the question of a Greek 
origin for (5) may be raised but cannot be answered. 


§IX. 


In 1773, Lessing, a well-known German literary figure 
and librarian of the famous Wolfenbiittel library, published 
a Greek “epigram”, i.e., a short poem (in 22 distichs) which 
he had recently discovered in one of the manuscripts under 
his care. The manuscript describes it as a problem sent by 
Archimedes to the mathematicians in Alexandria. 

Many mathematical epigrams are known. Most of them 
state problems of little depth; not so Lessing’s find; there 
is indeed every reason to accept the attribution to Ar- 
chimedes, and none for putting it in doubt. Only the question 
of its interpretation must remain open. 

We are dealing here (Arch.11.528—534) with a problem 
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in 8 unknown integers; we shall denote them by x, y, z, 4, 
x’, y’,z', t’. The first 8 distichs state, in fairly standard form, 
a set of three equations: 


_{i,2\,_,_(4,4),-, (44 1,2, 
XT Na TBP OV NET BOF NET TP 


this, of course, bespeaks a considerable degree of familiarity 
with systems of linear equations, but otherwise deserves no 
comment. Up to a factor, the solution is 


Xo = 2226, yo = 1602, z% = 1580, th = 891. 


Next we have in 5 distichs the four equations: 
r_- (lit nwo flit 
x= (3 + ror y= (4 + ers), 


roo ] 1 , ro af i 1 
z= (: + aaa )U = ( 6 + Diets ). 
Here things are a little more complicated, but the scholiast 
or his source (of unknown date, of course) is able to obtain 


a solution. Since x’, y’, z’, t’ must be integers, the conclusion 
is that we must have: 


(x, y, Zz, t) =n x 4657 x (Xo; Yo» 205 to) 


with some integer n; x’, y’, z’, t’ can be calculated corre- 
spondingly. In the numerical solution given by the scholiast, 
we have (unnecessarily, of course) n = 80. 

So far this was linear algebra; “if you can solve the problem 
up to this point”, says the author, “no one will call you 
ignorant, but this does not yet make you an expert; come 
and tell me the following”; whereupon he states, in two 
distichs, that x + y has to be a square, and, in two more 
distichs, that z + ¢ has to be a triangular number; “if you 
do that”, he concludes, “you win the prize for supreme 
wisdom”. He may well say so; it can be shown that the smallest 
solution is of the order of magnitude of 101°”. 

In terms of the integer n, in fact, we can write x + y = 
4An, where A turns out to be squarefree, so that we must 
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have n = AU”, where U is an integer. On the other hand, 
if z + fis a triangular number, i.e., of the form 3m(m + 1), 
then 8(z+é) + 1 must be a square V’; as we can putz + ¢ = 
Bn, we get Vv? — NU" = | withN = 8AB. The problem thus 
amounts to a special case of “Pell’s equation”. 

How are we to understand this? At the very least, it must 
have been meant to indicate that Archimedes was interested 
in such equations; perhaps also that he had some insight 
into the structure of the set of solutions; this, in turn, would 
presuppose a knowledge of the identity (5). At best, it could 
mean that he had devised a general method of solution for 
equations of the type x* — Ny? = 1; this would have had 
to depend, explicitly or not, upon the construction of the 
continued fraction for VN, i.e., upon the Euclidean algorithm 
as it applies to such a square root. Tannery seems to have 
favored the latter alternative and to have entertained the 
hope that the solution would some day re-appear with the 
lost books of Diophantus; so far no evidence for this has 
been forthcoming. Equations of the type x* — Ny? = 1 do 
occur in Diophantus (e.g. in Dioph.V.9 and 11), but it is a 
rational solution that is asked for, even when accidentally 
a solution in integers is obtained (e.g. in Dioph.V.9, where 
N is of the form m? + 1, giving y = 2m, x = 2m” + 1). 

With the Indians we are on firm ground again, in spite 
of the huge gaps in our knowledge; some of the main texts 
have been known in the West, in English translation, since 
Colebrooke published his Algebra with Arithmetic and Men- 
suration, from the Sanscrit of Brahmegupta and Bhascara (London 
1817). In the work of Brahmagupta, dating back to the 
seventh century, we find a whole section (Chap. XVIII, 87; 
Col.pp.363—372; cf. Datta and Singh, loc.cit. pp.146—-161) 
devoted to equations Ny? + m = x°, where N is a positive 
integer (tacitly assumed not to be a square), m is a positive 
or negative integer, and a solution (x, y) in integers is asked 
for; one should note that Brahmagupta and his successors 
are quite conversant with positive and negative numbers 
and 0, and that they state in full the rules for handling 
them. Also the technical vocabulary introduced by Brah- 
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magupta in connection with those equations has been 
adopted, with minor variations, by all his successors; the 
type of problem in question is known to them as vargaprakrti 
(“square-natured”); N is the gunaka (“qualifier”) or prakrti 
(“nature”); m is the ksepa (“additive”); y and x are the “first”, 
“minor” or “junior” root, and the “second”, “major” or 
“senior” root, respectively. It is in that section (verses 64— 
65) that we find the identity (5), given in the form of two 
laws of composition 


((x, y3 m), (z, t; n)) > (xz Nyt, xt+ yz; mn), 


where for brevity we write (x, y; m) for any triple of integers 
satisfying x° — Ny* = m, the “qualifier” N remaining the 
same once for all. In the subsequent literature, these laws 
became known as the bhdvana (“production”) rules; the 
bhavand is called positive or negative according as the sign 
is + or —, and equal or unequal according as (x, y) = (z, 4) 
or not. Thus these terms correspond exactly to Viéte’s syn- 
aeresis, diaeresis and angle-duplication (cf. supra, §V1). 

As Brahmagupta explains, the bhavand can then be used 
in various ways in order to derive new solutions from known 
ones; for instance, composition with a triple (p, q; 1) will 
generate an indefinite number of solutions for a given “ad- 
ditive” m when one is known. Similarly; when one has ob- 
tained, e.g. by composing a triple (x, y; m) with itself, a triple 
(X, ¥; M) with M = m’, this gives a rational solution 
(X/m, Y/m) for the equation with the additive 1, and it 
gives a triple (X/m, Y/m; 1) if X/m and Y/m are integers. 
More generally, if M = ym®, one can derive, from any 
triple (X, Y; M), a triple (X/m, Y/m; yw) provided X/m, Y/m 
are integers (Datta and Singh, loc.cit., pp.150-151). 

These remarks, in conjunction with the bhavana rules, 
already enable Brahmagupta to solve Pell’s equation 
x* — Ny? = 1 in a variety of cases (for instance N = 92 
and N = 83), and to give rules for its solution whenever a 
triple (p, q; m) is known with m = —1, m = +2 or 
m= +4. Form = —lorm = +2, it is in fact enough to 
compose (p, q; m) with itself once, the solution being 
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(p+ Ngq", 2pq; 1) resp. (A(p" + Ng"), pgs V); 
similarly, we have the solution 
(3(p" + Nq*), 4pq5 1) 

ifm = +4 and p is even. On the other hand, if m = +4 
and p is odd, one has to compose (p, q; m) twice with itself 
to obtain a triple (P, Q; + 1), then, if the sign is — , to compose 
(P, Q; —1) with itself; for these two cases, Brahmagupta 
gives explicit formulas, with polynomials in p, q of degree 
3 in one case and 6 in the other. 

With this, one is still far short of the general solution; 
since Colebrooke it has been known that such a one is to 
be found in the work of Bhaskara (twelfth century; Col. pp. 
170-184; cf. Datta and Singh, loc.cit. pp.161—172), but an 
almost identical presentation has now been found, attributed 
to an otherwise unknown author Jayadeva, in a commentary 
of the eleventh century (cf. K. S. Shukla, Ganita 5 (1954), 
pp.1~20). Both authors describe the method as “the cyclic 
process” (cakravala, from cakra, a wheel). Its true originator 
remains unknown. 

As is the case with many brilliant discoveries, this one can 
be seen in retrospect as deriving quite naturally from the 
earlier work. For a given N, assume that we have a triple 
(p, q; m) where m, in some sense, is small; one wishes to 
derive from this another such triple. To this end, one con- 
structs a triple (x, y; M) with M = mm’ and m’ small; com- 
posing it with the first one, we get a triple (X, Y; m’m’). 
If now X and Y are multiples of m, this gives us a triple 
(p’, q'; m') with p’ = X/m, q' = Y/m, and we can proceed 
in the same manner, hoping eventually to reach a triple 
(u, v; 1). In the cakravala, this is achieved by taking y = l and 
therefore M = x* — N, where x remains to be suitably cho- 
sen. This gives 


X= px+Nq, Y=prt qx. 
We may assume that q was prime to m; were this not so, the 


relation p* — Ng’ = m would show that p and q have a g.c.d. 
d > 1, and m would be a multiple of d’, so that the triple 
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(p, 3 m) could be replaced by (p/d, g/d; m/d”). We can now, 
by the kuttaka method (cf. supra, §1V), determine x so that 
Y = p + qxisa multiple of m. Writing 


gM — gx" _ Nq@ _ gx _ p +m 
+ 
=m xX (2 2 <ige-p)+1), 


m 


we see that M is a multiple of m, since q is prime to m. Now 
the relation X* = NY* + mM shows that X? is a multiple 
of m*, so that X is a multiple of m. Writing X = mp’, 
Y = mq’, we get the triple (p’, q’; m’). 

To make m’ small, observe that we can choose x, within 
its congruence class modulo m, such that x < VN <x + |m/; 
if then VN + x were <0, we would have 2VN < |m|; if 
therefore we assume |m| < 2\VN, this cannot happen, and 
we get 


0O<N — x? = (VN-x)(VN +x) < 2lm|\/N 


and therefore also |m'| < 2\V/N. Now we can apply the same 
procedure to (p’, q'; m'), obtaining a triple (p”, q”; m”) and 
so on. As the integers m, m', m",. . . are bounded, they must 
repeat themselves; perhaps this was the origin of the name 
given to the “cyclic” method. 

This calls for several observations. Firstly, the facts for 
which we have just given proofs may have been known to 
the Indians only experimentally; there is nothing to indicate 
whether they had proofs for them, or even for part of them. 
Secondly, in order to carry out the cakravala, one must have 
a Starting point; invariably they choose as such the triple 
(po, 1; mo) for which pj is the closest square to N, above or 
below; it is easy to see that |mol is then <2VN. That being 
so, the kuttaka is of course not needed at the first step, since 
go = 1; but it is also not needed in any further operation, 
since, if notations are as above, we have p’ — q’x = — qm’, 
so that at the next step one has to take x’ = —x (mod m’). 
Strangely enough, this does not seem to have been noticed 
by any of our Indian authors (nor even by their later com- 
mentators, down to the sixteenth century); they make no 
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mention of it, and invariably refer to the kuttaka for the 
choice of x, even though their abundant numerical evidence 
could easily have convinced them that this was unnecessary. 
Thirdly, the Indian prescription for choosing x within its 
congruence class modulo m is not quite what we have in- 
dicated above, since the rule is to make N — x® “small” (i.e., 
in actual practice, as small as possible), but, as the context 
shows, in absolute value; in other words, one should substitute 
x, = x + |m| for xifx; — N happens to be less than N — x’. 
It can be shown that this has merely the effect of abbreviating 
the procedure somewhat when that is the case. Finally, we 
are told to iterate the process only till we find an “additive” 
m with one of the values +1, +2, +4, and then to make 
use of the bhdvand, i.e., of Brahmagupta’s procedure for 
that case. Actually this is no more than a shortcut, since it 
can be shown that the cakravala, applied in a straightforward 
manner, would inevitably lead to a triple (p, g; 1) as desired; 
while this shortcut is quite effective from the point of view 
of the numerical solution, it destroys the “cyclic” character 
of the method, which otherwise would appear from the fact 
that the “additives” m, m', m", ... would repeat themselves 
periodically, corresponding to the periodicity of the con- 
tinued fraction of VN (cf. Chap.III, §XI]). 

For the Indians, of course, the effectiveness of the cakravala 
could be no more than an experimental fact, based on their 
treatment of a great many specific cases, some of them of 
considerable complexity and involving (to their delight, no 
doubt) quite large numbers. As we shall see, Fermat was 
the first to perceive the need for a general proof, and La- 
grange the first to publish one. Nevertheless, to have de- 
veloped the cakravala and to have applied it successfully to 
such difficult numerical cases as N = 61 or N = 67 had 
been no mean achievement. 


§X. 


When Fermat embarked upon his career as a number- 
theorist, he had little to go by (apart from Euclid) except 
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Diophantus, in Bachet’s edition, and Viéte’s masterly pre- 
sentation of a large portion of Diophantus in his Zetetica of 
1593. 

As he had later occasion to observe, neither Viéte nor 
even Diophantus were really number-theorists. “Number 
theory”, he wrote in his challenge of 1657 to the English 
mathematicians (Fe.II.335) “deals with integers; Diophantus, 
on the other hand, deals with rational numbers, and Viéte, 
by extending Diophantus’s work to continuous quantities, 
has made it clear that it does not really belong to number 
theory”. Fermat then goes on to assign the work of Viéte 
to “geometry”, and classifies that of Diophantus as “close 
to geometry”. 

From our modern point of view, things look somewhat 
differently. Firstly, since so much of Diophantus, and even 
more of Viéte, remains valid over arbitrary fields, we would 
classify this primarily as algebra; of course Viéte’s algebra, 
both in its notations and in its content, is far more advanced, 
and much closer to ours, than that of Diophantus. Secondly, 
the distinction between rational numbers and integers is 
not as clearcut as the above quotation from Fermat would 
suggest; as he knew very well, it does not apply when one 
is dealing with homogeneous equations. Thirdly, as will be 
seen presently, there is much, in Diophantus and in Viéte’s 
Zetetica, which in our view pertains to algebraic geometry. 
Moreover, modern developments have led to a better un- 
derstanding of the analogies (already dimly perceived on 
some occasions by Leibniz and Euler) between function- 
fields and number-fields, showing that there is sometimes 
little difference between solving a problem in rational num- 
bers and solving it in a field of rational functions. 

All these points of view acquire significance when one 
tries to classify the problems and methods which occur in 
Diophantus. His work, as it has been transmitted to us, 
consists of not quite two hundred problems, each of which 
asks for the solution, in positive rational numbers, of a given 
set of algebraic equations. Diophantus has notations for one 
unknown quantity at a time, and for its powers (with positive 


FRANCOIS VIETE 
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(Dlupres le Prontispice, par Rabel, de !édition de 1640.) 
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or negative exponents), but not for more; if, in the course 
of a problem, an auxiliary problem has to be discussed, the 
same notation is used for the new auxiliary unknown, and 
is then discarded. The data frequently include one or more 
constants; as these cannot be designated by letters, arbitrary 
numerical values have to be assigned to them; moreover, 
the solution is usually expected to contain as many param- 
eters as is compatible with the problem, or at least with the 
method of solution, but a numerical value has to be assigned 
to each at the moment of its appearance; this may be changed 
later if it turns out that it has to fulfil some condition, not 
satisfied by the initial choice (an obvious adaptation from 
the old regula falsi, well-known from its early use in linear 
problems). Occasionally a partial problem is discussed, 
sometimes as a lemma, with a parameter which later must 
play the role of the unknown; such explicitly parametric 
solutions are called “indefinite” (cf. e.g. Dioph.1V.19, where 
this term is defined). Of course Diophantus is aware of the 
existence of irrational quantities; occasionally, when an initial 
choice of some parameters would lead to such a value for 
the unknown, it is discarded by the words “the unknown 
becomes irrational” (cf. e.g. Dioph.1V.9). 

It is not easy to characterize the arrangement of the prob- 
lems into books, or within each book; it is also unnecessary, 
since we cannot be sure that this goes back to Diophantus 
(or whoever was the original author or compiler of our 
collection). As to the method, it consists (as d’Alembert put 
it, with his customary lucidity, in the article DIOPHANTE 
of the Encyclopédie of 1750) in “making such substitutions 
on the unknowns as to reduce the problem to a linear equa- 
tion in one unknown” (“manier tellement les inconnues ou Pin- 
connue, que le quarré et les plus hautes puissances de cette inconnue 
disparaissent de léquation, et quil ne reste que l'inconnue au premier 
degré”; Encyclopédie, tome 1V, p. 1014). Some types of ques- 
tions, however, occur with great frequency and will be ex- 
amined now. It will be best to describe them first in modern 
terms (cf. Chap.II, §§XIV—XV and Appendix IJ). 

Eventually, most problems are reduced to finding a point 
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with positive rational coordinates on an algebraic curve of 
genus 0 or 1, given by one or more equations. As explained 
above, the coefficients in those equations have to be stated 
numerically, but quite often they include parameters, and 
it is understood that the method of solution should give 
values to the unknowns which can be rationally expressed 
in terms of these. Of course it is easy to verify that this is 
sO as soon as some modern algebraic notation, Viéte’s for 
instance, becomes available. 

It is now possible to distinguish two main cases: 

I. Curves of genus 0: the curve must be such that at least 
one rational point on it, in affine or in projective space, is 
obvious or easy to find. There would then be no problem, 
in the eyes of Diophantus, if this “visible” solution was in 
affine space and satisfied all side-conditions (for instance 
inequalities) required for it. If that is not so, then it becomes 
relevant to observe that, on such curves, rational points are 
everywhere dense. This is made explicit by Viete (Zet.IV.5 = 
Op. pp. 65-67) in the special case of the circle X? + Y? = 
a’ +b; Diophantus seems aware of it, but he has no general 
way of expressing it, except by the use of a special word 
(maptodrns, which Fermat, following Xylander and Bachet, 
translates by adaequatio or adaequalitas) to designate his way 
of approximating a given number by a rational solution for 
a given problem (cf. e.g. Dioph.V.11 and 14). One typical 
case of a curve of genus 0, of frequent occurrence in Dio- 
phantus, is given by the plane conic AX? + BX + C = U?, 
where either A, C or (asin the second lemma to Dioph.VI.12) 
A + B + Cisasquare. One puts then U = pX + gif A = p” 
or C = q’; signs have of course to be adjusted according to 
the requirements of each case. 

II. Curves of genus 1: the curve must have at least one 
“visible” rational point and one “visible” rational pair; this 
is so, for instance, for an equation 


AX* + BX? + CX? + DX+E=U"* 


provided A or E is a square (cf. e.g. Dioph.VI.10). More 
often, one meets with a “double equation” P(X) = U’, 
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Q(X) = V®, with P and Q of the form 
P(X) = AX? + BX +C, Q(X) = A'X? + BX + C'. 


If P and Q have a root in common on the projective line, 
e.g.ifA = A’ = OorifC = C’' = 0, this defines a curve of 
genus 0 in the (X, U, V)-space; otherwise it defines a curve 
of genus 1. In Diophantus, it always turns out that P — Q, 
or sometimes more generally m’P — n°Q for suitable choices 
of m and n, has two rational zeros on the projective line; 
this is so e.g. if A = A’ or if C = C’. Then we can write 
P — Q = LM with rational factors L, M, and the quadric 
P — Q = U® — V* has two rational systems of lines 


U+Ve=tU, UFV=t 'M. 


The intersection of one such line with P = U® provides us 
then with a rational pair; if A and A’, or C and C’, are both 
squares, we have even two rational points. The general theory 
of curves of genus | makes it clear how further points may 
be derived from these. Diophantus achieves the same result 


by putting 
U = (tL +t" 'M) 


and then adjusting the value of t so that P = U? has a 
rational solution (cf. e.g. Dioph. 111.17 or IV.23). Also plane 
cubics fall within the scope of similar methods (cf. e.g. 
Dioph.1V.24). It should be noted, however, that Diophantus 
never goes beyond a single application of his process, and 
that this may fail to provide him with a solution satisfying 
required inequalities. We shall have to consider this point 
more closely when discussing Fermat’s treatment of the same 
questions (cf. Chap.II, $XV). 


§XI. 


While the above sketch seems to cover adequately almost 
the whole of the extant work of Diophantus, one also meets 
there with a few puzzling hints, possibly suggesting a concern 
with quite different topics. One problem (Dioph.V.9) asks 
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for a number A = 2a + | to be expressed as x” + y’, with the 
side conditions x* > a, y° > a; the latter may be disregarded, 
in view of Dioph.11.9, or of the fact that rational points on 
a circle are everywhere dense if the circle has one rational 
point (cf. supra, §X). On the other hand, our text of Dioph.V.9, 
which makes no sense at this point and is obviously corrupt, 
specifies a condition for a (which, it says, should not be odd) 
and also seems to make some reference to prime divisors 
of A. Thus it implies at least that a is an integer (while 
ordinarily Diophantus assumes merely that the data are 
rational). In the solution of the problem, a is given the value 
6, giving A = 13 = 2? + 3%, and the solution is x = 7f, 
y = 38%. In Dioph.V.10, we have a similar question with other 
side-conditions and with A = 9. In Dioph.V.11 we have the 
corresponding problem with A = 3a + 1 and the equation 
A =x? + y? + 27, and the text, which at this point is clear 
and unambiguous as it stands, merely prescribes that 
a be not of the form 8n + 2, 1e., that A be not of the form 
8n + 7. To all this must be added the statement, in 
Dioph.V1.14, that 15 is not the sum of two squares and that 
“consequently” the equation 15x” — 36 = y* cannot be solved 
in rational numbers; for these not altogether trivial state- 
ments, no explanation is given. Finally, as Bachet observed, 
the problems Dioph.IV.29 and 30 assume that any given 
integer (or, presumably, rational number) can be written 
as a sum of four squares; not surprisingly from our point 
of view, this is indeed so for the numerical data chosen in 
those problems. 

One is thus left to wonder just how much Diophantus, 
or his predecessors, knew about the decomposition of num- 
bers into sums of 2, 3 and 4 squares; it could be, of course, 
that their knowledge was merely experimental. Soon after 
taking up such questions, Fermat observed that the im- 
possibility of some decompositions can be proved quite easily 
by using congruences modulo 4 or 8; one sees thus, for 
instance, that A cannot be written as x° + y° if it is of the 
form 4n + 3, nor asx* + 9° + 2’ if it is of the form 8n + 7 
(cf. Chap.II, §V). Such arguments are not found anywhere 
in Diophantus, but they cannot be said to be alien to the 
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spirit of Greek arithmetic. Short of a chance discovery of 
hitherto unknown texts, the questions thus raised are 
unanswerable. 


§XII. 


For a mathematician of the sixteenth century, Diophantus 
was no easy text to decipher, and no small share of the 
credit for its re-discovery must go to Rafael BOMBELLI 
and to XYLANDER. Bombelli read and translated most of 
it in Rome, for his own use, about 1570, and then incor- 
porated this into his Italian Algebra of 1572; Xylander was 
the first to attempt a complete translation, and published 
the fruit of his efforts in Basel in 1575. Of course the sub- 
stantial difficulties of such undertakings were greatly in- 
creased by the comparatively poor condition of the available 
manuscript texts. As has been the case with virtually all 
Greek classical authors, these were all derived from a single 
codex (the so-called “archetype”, now lost) marred by copying 
mistakes and omissions. Worst of all, in the case of Dio- 
phantus, were the numerical errors. Undoubtedly the 
copying had been done by professional scribes, not by 
mathematicians; but perhaps this was just as well; would- 
be mathematicians might have made things even worse. 

The first translator, Holzmann, who hellenized his name 
as Xylander, was a humanist and a classical scholar who 
took up algebra as a hobby. Bombelli was a busy engineer, 
designing canals, desiccating marshes, and he dedicated to 
mathematics little more than those periods of leisure which 
were granted to him by his munificent employer and patron, 
the bishop of Melfi. When he inserted his Diophantine ma- 
terial into Book III of his Algebra, his intention seems to 
have been merely to enrich with it an earlier manuscript of 
his, composed at least ten years before. His own merits as 
an algebraist lay elsewhere, for instance in his treatment 
(quite a modern one) of complex numbers in his Book I, 
and in the theory of cubic and biquadratic equations in 
Book II. VIETE, on the other hand, while trained as a 
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lawyer and employed as such for much of his life, came to 
regard himself, and to be regarded by his contemporaries, 
as a mathematician first of all. “Ars nostra” is the word he 
uses for mathematics in the preface of his Isagoge of 1591, 
the first in a series of publications which was to establish 
his fame as the greatest algebraist of his age. “Pure gold” 
is how he describes the discoveries he is about to disclose; 
“not alchemists’ gold”, he adds, “soon to go up in smoke, 
but the true metal, dug out from the mines where dragons 
are standing watch”. Little did it matter to him, when he 
followed up the Isagoge with his Zetetica of 1593, that much 
of the ore for it had been supplied by Diophantus; his pur- 
pose was to demonstrate the power and scope of the new 
algebra; his use of Diophantus is directed wholly towards 
this end. In this, as Fermat rightly observed, whatever little 
number theory was present in Diophantus has been lost. 

Thus it was still a huge task that awaited the future editor, 
translator and commentator of Diophantus, even after all 
the work done by his predecessors. As Fermat’s son Samuel 
expressed it in his preface to the Diophantus of 1670 (echo- 
ing, no doubt, his father’s sentiments, perhaps his very 
words): 

“Bombelli, in his Algebra, was not acting as a translator 
for Diophantus, since he mixed his own problems with those 
of the Greek author; neither was Viéte, who, as he was 
opening up new roads for algebra, was concerned with 
bringing his own inventions into the limelight rather than 
with serving as a torch-bearer for those of Diophantus. Thus 
it took Xylander’s unremitting labors and Bachet’s admirable 
acumen to supply us with the translation and interpretation 
of Diophantus’s great work”. 

Claude Gaspar BACHET, sieur de Méziriac, was a country 
gentleman of independent means, with classical tastes, and 
no mathematician. Somehow he developed an interest for 
mathematical recreations and puzzles of the kind found in 
many epigrams of the Greek Anthology as well as in medieval 
and Renaissance mathematical texts, or nowadays in the 
puzzle columns of our newspapers and magazines. In 1612 
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he published in Lyon a collection of such puzzles under the 
title Problémes plaisants et délectables qui se font par les nombres. 
As this indicates, he was thus led to number theory, and so 
to Diophantus. The latter must have occupied him for several 
years prior to its publication in 1621; after seeing it through 
the press he retired to his country estate, got married, and 
apparently gave up all mathematical activity, except that 
he prepared a second edition of his Problémes of 1612, in- 
corporating into it some of the materials he had intended 
for a treatise on arithmetic which never saw the light (cf. 
supra, §1V). Incidentally, the Problémes won such applause 
that they have enjoyed successive editions even into the 
present century. 

Samuel Fermat’s praise of Bachet was by no means ex- 
cessive. No mere philologist would have made sense out of 
ever so many corrupt passages in the manuscripts; Xylander 
had all too often failed to do so. Bachet never tires of drawing 
attention to the defects of Xylander’s translation and com- 
ments, while naively extolling his own merits. He even ven- 
tures to speak disparagingly of Viéte’s algebraic methods, 
which he neither appreciated nor understood; this did not 
stop him from lifting two Porisms (III.11 and III.14) and 
some important problems about cubes (in his commentary 
to Dioph.IV.2) out of Viete’s Zetetica (Zet.1V.12,15,18,20 = 
Op. pp. 71,73,74,75) without a word of acknowledgment. 
Nevertheless, his is the merit of having provided his suc- 
cessors and notably Fermat with a reliable text of Diophantus 
along with a mathematically sound translation and com- 
mentary. Even his lack of understanding for the new algebra 
may be said to have benefited number theory in the end. 
It led him, no doubt, to overload his commentary and the 
“porisms” with which he prefaced it with a huge ballast of 
ludicrously clumsy proofs. At the same time, since his chief 
interest was number theory and not algebra, he invariably 
laid the emphasis on those aspects of the text which were 
more properly arithmetical, and, prominently among these, 
on all questions regarding the decomposition of integers 
into sums of squares (cf. supra, 8X1). He asked for the con- 
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ditions for an integer to be a sum of two or of three squares; 
he extracted from Diophantus the conjecture that every 
integer is a sum of four squares, and asked for a proof. 

The curtain could rise; the stage was set. Fermat could 
make his entrance. 


mem hat ALD 


Chapter Two 


Fermat and His 
Correspondents 


§I. 


Fermat’s outwardly uneventful life is soon told. He was 
born into a prosperous middle-class family in Beaumont 
de Lomagne, a little town in the south of France, not far 
from Toulouse in the province of Languedoc. His mother 
Claire de LONG belonged to the “noblesse de robe”, which 
means that her family had been ennobled through one or 
more of its members holding a magistrate’s office. He was 
baptized Pierre FERMAT on 20 August 1601. Before 1631 
he spent some time, perhaps some years, in Bordeaux. On 
14 May 1631 he was received into the “parlement”, i.e., the 
provincial High Court of Judicature, in Toulouse, with the 
title of councilor, an office which he held until his death; 
in virtue of it he was thereafter styled Monsieur de Fermat, 
and is so referred to by his contemporaries. On | June 1631 
he was married to Louise de LONG, a distant cousin of his 
mother’s. They had two sons and three daughters; the elder 
son Samuel became a magistrate, and, like his father, a 
councilor in the Toulouse High Court; his brother took 
orders and became a canon in the cathedral church in 
Castres; one daughter got married, two became nuns. Fer- 
mat’s professional life was divided between Toulouse, his 
main residence, and Castres, which was the seat of the 
“Chambre de I’Edit” of the Toulouse High Court; this was 
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the judicial instance concerned with the relations between 
the catholic and protestant communities of the province’. 
Fermat died in Castres on 12 January 1665 during one of 
his turns of office there. 

Obviously Fermat had enjoyed an excellent classical ed- 
ucation; he was well versed in Latin, Greek, Italian and 
Spanish, and generally praised for his skill in writing verse 
in several languages, a skill which he transmitted to his son 
Samuel. He collected manuscripts; his advice was eagerly 
sought on the emendation of Greek texts. With such interests, 
a seventeenth century gentleman might naturally think of 
a journey to Italy; in Fermat’s case the active scientific life 
in that country, and the presence there of such men of 
science as Galileo, Cavalieri, Ricci, Torricelli, should have 
provided an additional attraction. Indeed some of his best 
friends did make the trip and visited the Italian scientists 
on the way: Carcavi in 1634, Beaugrand in 1635, Mersenne 
in 1644 and 1645. There is no indication that Fermat ever 
thought of following their example. In 1636, in one of his 
very first letters to Mersenne (Fe.11.14), he speaks somewhat 
doubtfully of “finding some occasion for spending three or 
four months in Paris”, which would have given him, he says, 
the opportunity for writing up his ideas on geometrical 
topics; but this was not to happen. Late in life, in 1660, he 
expresses to Pascal the earnest wish to meet him; “as his 
health is little better than Pascal’s” this would have to take 
place “somewhere halfway between Clermont and Toulouse” 
(Fe.II.450); Pascal, also for reasons of health (Fe.11.450- 
452), did not undertake the journey, which indeed, in those 
days, might have been a strenuous one for an invalid. In 
the same year, Fermat, hearing that Huygens is in Paris 
(Fe.11.452), assures him that he would go there to meet him 
if only his health would allow him to travel; obviously he 


A number of letters of Fermat’s, touching upon his professional life, 
have been preserved (Fe.II, letters LILLLIV,LV,LIX,LXIV,LXV,LXVI, 
CXI, and Fe.1V.15—22; cf. Fe.111.505). Cf. also the excellent study by 
P. Chabbert, Fermat 4 Castres, Rev.d’Hist.des Sc. 20 (1967), pp. 337-348. 
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hopes that Huygens, with whom he had communicated 
through Carcavi on scientific matters (Fe.I1.322,328,431, 
446), would take the hint and come to visit him; if so, his 
expectation was nut fulfilled. 

Finally Fermat died without ever* having ventured farther 
from home than Bordeaux. As his letters show, his vacations 
were ordinarily spent “a@ la campagne”, i.e., in his country 
house. 

We have no data to tell us when Fermat became interested 
in mathematics, but this must have occurred at the latest 
during his stay in Bordeaux in the late twenties. There he 
met, among others, Etienne d’ ESPAGNET, himself a mag- 
istrate and a magistrate’s son; they soon became intimate 
friends (cf. e.g. Fe.1I.71,105,136). D’Espagnet was some five 
years older than Fermat, was mathematically inclined, and 
one likes to think of him as having been the first to perceive 
Fermat’s genius and having given him the kind of encour- 
agement which few young men can quite do without. He 
had in his possession, and he communicated to Fermat, 
some of Viéte’s unpublished writings (Fe. 11.133); presumably 
he had a more or less complete collection of Viéte’s works, 
which were not easily obtainable at that time; it 1s obvious 
that Fermat studied them closely, early in his career, long 
before they were made generally available by F. van Schoo- 
ten’s publication of Viéte’s Opera Mathematica in 1646. As 
to d’Espagnet, he remained Fermat’s lifelong friend, keeping 
up, not only his personal relations with him, but also his 
keen interest in Fermat's mathematical progress (cf. 
Fe.11.221). In 1646, Mersenne had to get d’Espagnet to 
bring pressure on Fermat in order to “drag him away 
from Bergerac” for a three days’ meeting in Bordeaux 
(Fe.Supp.136, note (1)). 

Viéte had died in Paris in 1603; in the twenties and thirties, 
some of his admirers were concerning themselves with the 


? Or hardly ever; in 1631 he is listed as having a law degree from 
Orléans; but perhaps this could have been obtained im absentaé. 
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diffusion of his methods; their efforts were to culminate 
in the 1646 publication, referred to above. This small group 
included Jean de BEAUGRAND, “mathematician to H. H. 
Gaston d’Orléans” (whatever this may have meant), who 
later was to hold office as Court secretary (“secrétaire du 
Roy”; cf. Fe.1V.22, note (1), 2°); like most of the scientists 
or scientifically interested men in those days, he was in touch 
with the famous Father MERSENNE, no serious mathe- 
matician himself, but who nonetheless collaborated with 
him in annotating a 1631 edition of Vieéte’s Jsagoge and Notae 
Priores; the latter had never been published, and the former 
was out of print. In view of all this, when we find Beaugrand, 
in a 1630 letter to Mersenne (Mers.11.515), mentioning “ces 
Messieurs de Bordeaux”, to whom he had sent a pretty little 
mechanical problem, it does not seem far-fetched to assume 
that this reference included Etienne d’Espagnet; it could well 
have included also Fermat if he was already known by name 
to Beaugrand at that time. Certain it is that, in the years 
that followed, and at least until 1638, a number of com- 
munications on mathematical topics passed between Fermat 
and Beaugrand (cf. Fe.{1.4—5,20,28,72,94,105,106,133). 
During his trip to Italy in 1635, Beaugrand spoke glowingly 
of Fermat to the Italian scientists (cf. Fe. 1].26, note (1)) and 
showed them some of Fermat’s results (cf. Fe.Supp.99), pos- 
sibly (according to contemporary hints) attributing some of 
the credit to himself (cf. Fe.11.26, note (1), Gal.XV1.328,345, 
and Fe.Supp.113,114). 

Beaugrand died in 1640; it is not clear whether he ever 
met Fermat, and there are signs that the latter’s warm regard 
for him (Fe.11.4—5) had cooled before 1640 (cf. Fe.11.111,133, 
and perhaps II.207). A more enduring friendship was the 
one between Fermat and Pierre de CARCAVI, who became 
his colleague at the Toulouse High Court in 1632 and re- 
mained there until 1636. In that year he transferred to 
Paris, where he formed a close acquaintance with the scientific 
circle gathered around Mersenne, Etienne Pascal and Ro- 
berval. Fermat’s scientific correspondence with the members 
of that group begins with a letter to Mersenne a few days 
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after Carcavi’s arrival in Paris. From then on it never stopped 
until around 1662, when Fermat, perhaps for reasons of 
ill health (cf. Fe.1I.450) allowed “his Geometry to fall into 
a deep sleep” (Fe.11.485). Clearly this correspondence, a 
not inconsiderable part of which has been preserved, pro- 
vided Fermat with his main outside incentive for pursuing 
his mathematical work, since the closest he ever came to 
personal contact with a mathematician (apart from a hy- 
pothetical visit from Beaugrand) was the above-mentioned 
three days’ meeting with Mersenne, which could hardly be 
so described. As to Carcavi, who took over, after Mersenne’s 
death in 1648, the latter’s role as an intermediary between 
contemporary men of science, he truthfully introduced 
himself to Huygens, in 1656, as “one not well versed in 
mathematics, but loving it passionately” (Fe.IV.118). Nor 
could Father LALOUVERE, the teacher of mathematics at 
the local Jesuits’ college in Toulouse, have been of much 
use to Fermat; his unhappy excursion into integral calculus 
on the occasion of the famous prize competition of 1658 
on the cycloid (cf. Fe.11.413,430) is gleefully made fun of 
by Pascal (no friend of the Jesuit order) in his Histoire de la 
Roulette of 1658. Nevertheless, we owe to him the one and 
only piece of writing of Fermat’s that he ever allowed to be 
printed, the anonymous dissertation (Fe.1.199—254) on the 
rectification of curves which Lalouvére published in 1660 
as an appendix to his book on the cycloid. In that respect, 
mention must be made also of the eight anonymous pages 
which J. E. Hofmann discovered in 1943, appended to a 
copy of Frenicle’s exceedingly rare pamphlet of 1657 on 
“Pell’s equation” and other topics’; this consists of a letter 
to Frenicle and one to Digby and must have been printed 
for private use without Fermat's knowledge, since apparently 
it was unknown even to his son at the time of the preparation 
of the Varia Opera of 1679. 


3 J. E. Hofmann, Neues tiber Fermats zahlentheoretische Herausfor- 
derungen von 1657, Abh.d.preuss.Akad.d.Wiss. 1943-1944, Nr.9, pp. 41- 
47. 
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The question of the publication of Fermat’s work was 
indeed raised on several occasions during his lifetime. As 
we have seen (Chap.I, §1), he once expressed a firm resolve 
of writing up his number-theoretical discoveries in book- 
form (Fe.1.305, Obs.X VIII). In 1654 he requested Carcavi’s 
and Pascal’s cooperation to a more ambitious plan: he wished 
them to help him prepare the bulk of his mathematical 
work for publication, while Pascal would have been in sole 
charge of the number theory, which Fermat despaired of 
ever writing up in full (Fe.11.299). This came to naught; 
but in 1656 Carcavi mentions the matter to Huygens 
(Fe.1V.120); he does so once more in 1659 (Fe.IV.126), in 
spite of Fermat’s rather discouraging and discouraged con- 
clusion to the summary of his arithmetical discoveries (“mes 
resvertes”, as he calls them) in his communication of August 
1659 (Fe.11.436). Finally nothing was done before his death. 

Actually, in those days, it was not quite a simple matter 
for a mathematician to send a work to the press. For the 
printer to do a tolerable job, he had to be closely supervised 
by the author, or by someone familiar with the author’s 
style and notation; but that was not all. Only too often, once 
the book had come out, did it become the butt of acrimonious 
controversies to which there was no end. Should one wonder, 
then, if Fermat, whenever the question of publication arose, 
insisted on anonymity above all (Fe.1I. 106,299)? At the same 
time, it is clear that he always experienced unusual dif- 
ficulties in writing up his proofs for publication; this awk- 
wardness verged on paralysis when number theory was 
concerned, since there were no models there, ancient or 
modern, for him to follow. 

Fermat never bothered to keep copies of his scientific 
communications to his correspondents (cf. e.g. Fe.11.218). 
Thus, after his death, it fell to the lot of his son Samuel to 
bring together, as best he could, the scattered remnants of 
his father’s writings. He began with the Diophantus of 1670, 
a reprint of Bachet’s Diophantus of 1621 where he inserted 
the full text of the notes jotted down by Fermat in the margins 
of his personal copy. To this volume the Jesuit father Jacques 
de BILLY, a former friend of Bachet’s and a teacher of 
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mathematics in Dijon, contributed a 36-page essay, the Doc- 
trinae Analyticae Inventum Novum, based on a series of letters 
from Fermat (now lost except for an early one of 1659: 
Fe.11.436—438); it describes in some detail Fermat’s methods 
for handling diophantine equations of genus 1 (cf. Chap.1, 
§X, and infra, XV), and naively extolls Fermat’s merits far 
above those of Diophantus, Viéte and Bachet. This was 
followed in 1679 by the volume of Varia Opera, the bulk of 
which consisted of Fermat’s writings on geometry, algebra, 
differential and integral calculus, together with a number 
of letters to and from Mersenne, Roberval, Etienne Pascal, 
Frenicle, Blaise Pascal, Carcavi, Digby, Gassendi. Not a few 
letters, among them some important ones on “numbers”, 
were not included, obviously because their recipients failed 
to send them to Samuel; thus they remained unknown until 
our times. 

Now, with the publication of Fermat’s complete works in 
4 volumes by P. Tannery and Ch. Henry (Paris 1891-1912; 
Supplément by C. de Waard, 1922), to which must be added 
a few fragments in Mersenne’s Correspondance and J. E. 
Hofmann’s publication mentioned above, it is to be pre- 
sumed, barring further discoveries, that we possess the entire 
Fermat corpus, tantalizingly incomplete as it must appear. 


SI. 


In fulfilment of our program, only Fermat’s work on 
“numbers” will be considered here; we shall leave aside, not 
only his extensive writings on geometry and the differential 
and integral calculus, but also his important work on prob- 
abilities (contained in his correspondence with Pascal, 
Fe.11.288-312, and with Huygens, Fe.II.320-—331) and his 
contributions to algebra, viz., elimination-theory (Fe.I.181— 
188) and his treatment of Adriaen van Roomen’s cyclotomic 
equation (Fe.1I.189-194). 

Perhaps binomial coefficients pertain more to algebra 
than to number theory, even though Fermat, describing an 
early result of his about them (Fe.1.341, Obs.XLVI), com- 
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ments that “there can hardly be found a more beautiful or 
a more general theorem about numbers”. The theorem in 
question concerns the integers denoted, in modern notation, 


by 
n+m-1 
m 


and described by Fermat and his contemporaries as “tri- 
angular” (for m = 2), “pyramidal” (for m = 3), “triangu- 
lotriangular” (for m = 4), etc.; they are defined by the re- 
current relation 


(avi) = (a) + (Cees) = ("2") 


Undoubtedly Fermat was aware of the role played by these 
numbers in the binomial formula; he may have known about 
it, if not from Stifel (whom he never read, as he says 
in a different context: Fe.11.188), at least from his study of 
Viéte; cf. e.g. the latter’s Ang.Sect., published in 1615 
(= Op.Math.287—304). Fermat’s theorem is no other than 
the formula 


There is of course no way of dating Fermat’s marginal 
notes in his Diophantus; but this one must surely have an- 
tedated his somewhat complacent mention of the same result 
(as “une tres-belle proposition”) to Roberval in 1636 (Fe. 11.84), 
then again (as “propositionem pulcherrimam”) to Mersenne 
(Fe.11.70; for that letter we adopt the date 1638 proposed 
by J. Itard, Rev.Hist.Sc. 2 (1948), pp. 95-98). This theorem 
was to stand him in good stead in 1654 in his work on 
probabilities; in connection with the latter topic, it was ob- 
tained independently in that same year by Pascal, who, 
strangely enough (since he belonged to Roberval’s circle) 
had not been aware of Fermat’s prior discovery (cf. Fe.11.308, 
and Fe.11.70, note (1)). In the thirties, Fermat used it pri- 
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marily in order to find theorems on sums of the form 


N 
Sm(N) = >) n” 
n=l 


or more generally i'(an + 6)”. Apparently (cf. Fe. 11.84) he 
did this by writing, as above 


N+m)\ _ S n+m—1 
m+] a m 
and then using his theorem to write 


("+m ' = = n(n 1)... (n+m-—1) 


m 


if 
ae + Ayn! +o + Ann) 


with numerical coefficients Aj, ..., A,-1. This gives 
SAN) + AiSp—(N) + + + An-iSi(N) 


1 

= —— N41)... (+m) 
from which one can obtain the formulas for the sums S,,(N) 
by induction on m. Fermat, while noting (Fe.I1.68,84) that 
the case m = 2 had been known to Archimedes and the 
case m = 3 to Bachet, gives the formula for m = 4 asa 
sample, adding that the general case can be treated by the 
same method. The same approach was rediscovered by Jacob 
BERNOULLI (and is described in his posthumous Ars Con- 
jectandi of 1713), leading him to the definition of the “Ber- 
noulli numbers” and “Bernoulli polynomials”, whose im- 
portance for number theory did not begin to appear until 
later at the hands of Euler. As to Fermat, after stating the 
above results to Roberval in 1636, he merely adds (Fe.11.85) 
that he had applied them to his calculation of the integral 
§x”dx (or rather, in his language, to the quadrature of the 
“parabola” y = x”; cf. Fe. 11.73,94,95), in conscious imitation 
of one of the methods which Archimedes had used in the 
case m = 2 (cf. also Fe. 11.83—84, and Fe.1.342, Obs. XLVIID). 
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III. 


In communicating his theorem on binomial coefficients 
to Roberval and to Mersenne, Fermat says nothing about 
the proof. On the other hand, his statement of it in the 
margin of the Diophantus ends up with the words “demon- 
strationem margini inserere nec vacat, nec licet” (“I have no time, 
nor space enough, for writing down the proof in this margin”) 
quite similar to those (“hanc marginis exiguitas non caperet’, 
“the margin is too narrow for this proof”) which conclude 
his notorious statement about the equation X" + Y" = Z". 
This coincidence, as may be noted here in passing, makes 
it plausible that the latter was also made rather early in 
Fermat’s career. At any rate there cannot be much doubt 
about Fermat’s proof for the theorem on binomial coeffi- 
cients; if, as he asserts, he had worked out such a proof, it 
could only have been one by induction. It is of course a 
proof by induction that Pascal] spells out for it, with almost 
pedantic clarity, in the Triangle Arithmétique, and Bernoulli 
does the same in the Ars Conjectandi. Fermat himself has at 
least one proof clearly worked out by induction in his Apol- 
lonius (Lib.I, prop. VII; Fe.1.26~—27), a geometrical treatise 
written largely before 1630 and completed no later than 
1636 (Fe.1I.100). 

In those days, and even much later, the word “induction” 
was used in an altogether different sense; it indicated no 
more than what is now understood by “conjecture”. To 
verify “by induction” a statement P(n), depending upon an 
integer m, meant merely to verify it for some (and sometimes 
very few) initial values of n. Of course it could happen that 
the procedure used, say, in order to verify P(4) was to derive 
it from P(3) in such a manner as to make it obvious that 
the same argument would serve to derive P(n) from P(n — 1). 
In that case, which occurs not seldom in Euclid (e.g. in 
Eucl.1.45, VII.14, VIII.6, to give a few random examples) 
a mathematician has the right to regard such a proof as 
conclusive, even though it is not couched in the conventional 
terms which would be used for it to-day. For such proofs, 
it became customary to use the term of “complete” (or 
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“mathematical”) induction’, while “incomplete” induction 
came to mean the heuristic argument described above. 

Induction of the “incomplete” kind was used quite sys- 
tematically by Wallis in his Arithmetica Infinitorum of 1656 
and applied to a wide variety of questions, including the 
theory of binomial coefficients; eventually his results led to 
Newton’s binomial series, and in the next century to Euler’s 
theory of eulerian integrals and the gamma function. What 
has to be emphasized here, however, is the criticism offered 
by Fermat in 1657 to Wallis’s use of “induction”: 

“One might use this method if the proof of some prop- 
osition were deeply concealed and if, before looking for it, 
one wished first to convince oneself more or less of its truth; 
but one should place only limited confidence in it and apply 
proper caution. Indeed, one could propose such a statement, 
and seek to verify it in such a way, that it would be valid in 
several special cases but nonetheless false and not universally 
true, so that one has to be most circumspect in using it; no 
doubt it can still be of value if applied prudently, but it 
cannot serve to lay the foundations for some branch of 
science, as Mr. Wallis seeks to do, since for such a purpose 
nothing short of a demonstration is admissible” [“On se 
pourrott servir de cette méthode, si la démonstration de ce qua est 
proposé étort bien cachée et qu’auparavant de s'engager a la chercher 
on se voulut assurer & peu pres de la vérité; mais il ne s’y faut fier 
que de bonne sorte et on doit y apporter les précautions nécessaires. 
Car on pourroit proposer telle chose et prendre telle régle pour la 
trouver quelle seroit bonne a plusieurs particuliers et néanmoins 
seroit fausse en effet et non universelle. De sorte qu'il faut étre fort 
circonspect pour s’en servir, quoiqu’en y apportant la diligence 
requise, elle puisse étre fort utile, mais non pas pour prendre pour 
fondement de quelque science ce qu’on en aura déduit, comme fart 
le steur Wallis; car pour cela on ne se doit contenter de rien moins 
que d’une démonstration . . .”: Fe.11.351-—352]. 

Predictably, Wallis was displeased, and, in a letter to Digby 


* In French, “récurrence” (for “complete” induction) avoids the am- 
biguity due to the double meaning of “induction”. 


Fermat and His Correspondents 51 


(Comm.Epist., letter XVI = Wal.11.782—783), haughtily de- 
clined to answer the charge. As his results have largely been 
substantiated by later developments, Fermat’s criticism may, 
in retrospect, strike us as too harsh, even though he sought 
to soften it by some high praise for Wallis (cf. also 
Fe11.337,343). What concerns us here is that it seems to 
express Fermat’s mature views about proofs. Similarly, 
writing to Clerselier in 1662, he states that “the essence of 
a proof is that it compels belief” (“la qualité essentielle d’une 
démonstration est de forcer a crotre”: Fe.11.483). This is not to 
deny, of course, that he was liable to error; he was so, perhaps 
more than others, in view of his fatal habit of seldom writing 
down his proofs in full and hardly ever keeping copies of 
his communications to his friends. Nevertheless, in view of 
the above quotations, when Fermat asserts that he has a 
proof for some statement, such a claim has to be taken 
seriously. 


SIV. 


Man’s love for numbers is perhaps older than number 
theory; perhaps it assisted in its birth; it has ever been its 
faithful companion, not untainted by superstition at times 
(cf. Chap.I, §III). As noted above (Chap.I, §1X) the Indians’ 
love for large numbers may have supplied a motivation for 
their treatment of Pell’s equation. Bachet had been a number- 
lover before turning to Diophantus (cf. Chap.I, §XII). 

Perhaps none of Fermat’s contemporaries deserves to be 
called a number-theorist; but several were number-lovers. 
One of these was Mersenne, who was interested in “perfect” 
and “amicable” numbers; so were P. Bruslart de SAINT- 
MARTIN and A. Jumeau de SAINTE-CROIX, both older 
than Fermat, and B. FRENICLE de BESSY, who was some 
years younger and was to become a member of the first 
Academy of Sciences when it was founded by Colbert in 
1666. Frenicle was once praised by Fermat, not without a 
touch of irony perhaps, as “this genius who, without the 
succor of Algebra, penetrates so deeply into the knowledge 
of numbers” (Fe.1I1.187). 


52 Number Theory Chap.II, §1V 


Early in his career, in his Bordeaux period (cf. Fe.11.196), 
Fermat had been attracted by “magic squares”; these are 
square matrices of positive integers satisfying certain linear 
conditions. Stifel, Cardano, Bachet had mentioned this topic. 
In 1640, Fermat and Frenicle discovered through Mersenne 
their common interest in it; this was the beginning of a 
correspondence between them (cf. Fe.1I.182-—197), which, 
after a somewhat inauspicious beginning (Fe.II.182—185), 
soon turned to more substantial topics and, in spite of 
some occasional friction (cf. Fe.11.260) was continued, rather 
desultorily at times, throughout the next two decades. 

It is difficult for us to take magic squares seriously, in 
spite of Fermat's professed enthusiasm for it (Fe.1I.194) 
and of E. Lucas’s intriguing suggestion (Fe.IV.190) that they 
may have led to the discovery of the fundamental identity 
for sums of four squares (cf. infra, §XIV, and Chap.III, 
§XI). An even more popular topic with the number-lovers 
of those times turned out to be far more fruitful at the 
hands of Fermat; it is the one which grew out of Euclid’s 
theorem on “perfect numbers”, and which was invariably 
referred to, in Fermat’s time, under the rather unsuitable 
name of “aliquot parts”. Somehow the term “aliquot part”, 
of uncertain origin, came to denote any divisor, including 
I but excluding n, of an integer n, and the plural “the aliquot 
parts” to denote the sum s(n) of such divisors. Thus “perfect” 
numbers are those which satisfy s(n) = n, and “amicable” 
numbers are the pairs n, m satisfying s(n) = m, s(m) = n. 
In addition to these, Mersenne and others started looking 
for “submultiple” numbers, 1.e., solutions of s(n) = a.n, where 
a is 2, 3, or some other small integer. Even DESCARTES, 
around 1638, was drawn into the search for such numbers, 
and, with his usual arrogance, soon became the self-pro- 
claimed champion at this game, while at the same time as- 
suring Frenicle (a rival contender for the title) that it was 
less than a year since he had even heard of such matters. 
It was a basic tenet of Cartesian philosophy that any man, 
by dint of proper self-discipline and by proper use of his rea- 
soning faculty, could construct within his own mind the 
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whole system of everything there is to know about the ma- 
terial and immaterial world. The emphasis being on the 
word proper, it turned out, of course, that Descartes, and 
only he, had ever been capable of such a feat, and conse- 
quently that others could well learn from him, but he had 
learnt nothing from others and never had had to. He stoutly 
denied, for instance, that his algebra owed anything to Viete, 
directly or indirectly, and, however unlikely such assertions 
may appear, there is no way of checking their degree of 
veracity. 

Actually, not a little ingenuity is required in order to 
obtain all the “perfect”, “amicable” and “submultiple” num- 
bers which turn up in the letters exchanged at that time 
between Fermat, Mersenne, Frenicle and Descartes. Fermat 
had started considering such questions long before 1636 
(Fe.11.20); in 1636 he pronounced them to be very difficult, 
thought he had a general method for solving them all, and 
told Roberval that he was planning a small treatise on the 
subject (Fe.11.93); in 1643 he still speaks complacently of 
“excellent discoveries” about aliquot parts, but his interest 
in them seems to be waning (Fe. 11.255); already by that time 
diophantine problems and sums of squares occupy the 
foreground. 

To gauge the difficulty of the problem for some of Fermat's 
contemporaries, one must remember that, as late as 1640 
(Fe.IV.69), Mersenne was asking Saint-Martin how to find 
the number of divisors of 49000 and their sum without 
enumerating them one by one; Euclid, and perhaps even 
the mathematicians around Plato (cf. Chap.I, 811) might 
have found this a naive question. Of course a good algebraic 
notation is a great help in such matters; with such notations 
as Descartes and Fermat had at their disposal, it is almost 
trivial to verify that, if p is a prime, then s(p") = (p"— 1)/(p— 1) 
for all n (a result which, in substance, is contained in 
Eucl.1X.35). If, for our convenience, we denote by S(n) the 
sum of the divisors of n including n, 1.e., S(n) = s(n) + n, 
then it is equally easy to show that S(mn) = S(m)S(n) when 
m,n are mutually prime. Those two facts are at the bottom 
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of Euclid’s proof for his theorem on perfect numbers: 
2" 1(2"— 1) is perfect if the second factor is a prime’. In 
the same way one may verify the rule for amicable numbers, 
sent by Descartes to Mersenne in 1638: 2”*'(18 x 2?”— 1) 
and 2”*'(3 x 2"—1)(6x2"—1) are amicable if the second 
factor of the former, and the second and third factors of 
the latter, are primes (Desc.11.93—94); undoubtedly this rule 
must have been known also to Fermat, and it was in fact 
much older (cf. Fe.1V.185). To quote just another example 
among those constructed at that time, here is one for which 
s(n) = 5n, split into its prime factors: 


n= 2° x 38 x 5> x 77 x I] x 137 x 19 x 31? x 43 
x 61 x 83 x 223 x 331 x 379 x 601 x 757 
x 1201 x 7019 x 112303 x 898423 x 616318177. 


It was proposed by Frenicle to Fermat, through Mersenne, 
in 1643 (Fe.11.255) and, from Fermat’s response, it appears 
that he must have known of it already. As can readily be 
seen, the whole point, in constructing such numbers, lies 
in ascertaining whether some large numbers, particularly 
such of the form 2” — 1, 3" — 1, are prime, and, if not, 
splitting them into their prime factors. For instance, in 1640, 
Frenicle asked Fermat, also through Mersenne, for a perfect 
number between 10°° and 10”* (Fe.1I.185); implicitly the 
question concerned “euclidean” perfect numbers, 1.e., those 
of the form 2”~ '(2”~ 1), since no others were known at the 
time (nor indeed now). The inequality 


107° < gr lon — 1) < 1072 


gives 34 < n < 37, and 2” — | cannot be a prime unless n 
is so, since 2% — 1 divides 2” — 1 for alla and b. Therefore 
the point in Frenicle’s question was really to find out whether 
2°” _ lisa prime. It is not, and there is no perfect number 
between 10° and 10” (or at any rate no “euclidean” one), 
as Fermat answers soon enough (Fe.II.194); fortunately for 


° That these are the only even perfect numbers was proved by Euler; 
the proof is not difficult and may be recommended to the reader as an 
exercise. 
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his self-esteem, he had detected, just in time, a numerical 
error in his calculations, which had nearly made him fall 
into the trap carefully laid for him by Frenicle (Fe.11.199). 
What he found was the decomposition 


2°7 — 1] = 137438953471 = 223 x 616318177. 


If, at the same time, one notices the prime factor 
decompositions 


616318177 + 1 =2 x 7° x 898423, 
898423 + 1 = 2° x 112303, 
112303 + 1 = 2* x 7019, 
7—~1=2°x 3x 5? x 1201, 

39 ~ | = 2 x 13 x 757, 


2 66 


one has all that is needed in order to check Frenicle’s “sub- 
multiple” number quoted above. 

What is far more interesting for us is the way in which 
Fermat proceeded in order to factorize 2°’ — 1. His pro- 
cedure, as he told Mersenne about June 1640 (Fe.11.198), 
depended upon the following three propositions: 


(I) If n is not a prime, 2” — |] is not a prime. 
(II) If is a prime, 2” — 2 is a multiple of 2n. 
(III) If n is a prime, and is a prime divisor of 2" — 1, 
then p — 1 isa multiple of n. 


As mentioned above, (1) is trivial in the sense that it is 
the special case x = 2° of the identity 


x? — 1 = (x—1)P4+x? 2 t $1), 


That Fermat should mention this as a discovery merely 
shows how primitive his algebraic knowledge was at that 
time. As to (II) and (III), they are of course typical cases 
of what is known now as “Fermat’s theorem”. For a general 
formulation, we have to take up his letter to Frenicle of 18 
October 1640, where it is stated as “la proposition fondamentale 
des parties aliquotes”; it is as follows (Fe.11.209): 
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“Given any prime p, and any geometric progression 1, a, 
a’, etc., p must divide some number a” — 1 for which n 
divides p — 1; if then N is any multiple of the smallest n 
for which this is so, p divides also a¥ — 1.” 

Actually it is not hard to discover this “by induction’, i.e., 
experimentally, for instance in the case a = 2, the decisive 
one for perfect numbers. If Frenicle knew of it, which seems 
to have been the case, one may assume that he had proceeded 
in that manner; so did Leibniz, in the years between 1676 
and 1680°; so did Euler in 1731 and subsequent years: “ten- 
tando comprobari potest”, “its truth can be verified by exper- 
iment”, he writes in his very first paper on number theory 
(Eu.1-2.1 in E26{1732; cf. Chap.1L1, §IV). One can hardly 
doubt that Fermat had arrived at it in the same way; but 
already in 1640 he says he has a proof; “he would send it 
to Frenicle”, he writes, “if he did not fear being too long”. 
What could this proof be? 

There are two classical formulations for Fermat’s theorem, 
and correspondingly two proofs for it. Writing it as a? = a 
(mod p), as Fermat did for a = 2 in his letter to Mersenne, 
one proves it additively, writing, for a@ = 2: 


srearyr=1+(f)+(Z)+~+(,2,) +1 


and observing that all the binomial coefficients in the right- 
hand side are multiples of p; this was indeed contained in 
what Fermat had proved about binomial coefficients, not 
later than 1636 (cf. above, §II). The same result can then 
be extended to a > 2, either by “mathematical induction”, 
writing 


(a+1)? = a? + (?)ar- foe (,?)¢ +] 
=a’+1 (mod) 


as Euler did in 1742, or else by using the “multinomial 


® On Leibniz’s unpublished excursions into number theory, cf. the 
excellent article by D. Mahnke, Bibl.Math. (3) 13 (1912-13), 26-61. 
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formula” 


(4+lte4)?=1ltleent14+ > fi 
gtrtts=p gir! ... s! 
as Leibniz did (cf. D. Mahnke, loc.czt., p. 49). 

On the other hand, if one writes the theorem as Fermat 
did in his letter to Frenicle, then it is a special case of the 
fact that the order of a group is a multiple of the order of 
any subgroup. One must then firstly observe that, if one 
divides by a prime p the numbers in the geometric pro- 
gression I, a, a’, ..., with a prime to p, the remainders must 
repeat themselves, so that, for some r 2 0 and some n > 0, 
one must have a"*’ = a’, ie., a’(a"—1) = 0 (mod p); this 
gives a" = | (mod p). Take for n the smallest number for 
which this is so; then one can arrange the p — 1 congruence 
classes prime to p modulo p into sets {b, ba, ..., ba"~ '}, each 
consisting of n elements; as these sets are easily seen to be 
disjoint, n must divide p — 1. This is the multiplicative proof 
obtained by Euler around 1750, of which he says that it is 
the better one since it can be generalized to a” = 1 (mod m) 
for any m (cf. Chap. III, §VI). In view of the way in which 
Fermat changed the formulation of his theorem from (II) 
to (III) and then to his statement to Frenicle, one may guess 
that his final proof was the second one. 

As to the use to which this can be put, Fermat explains 
it quite clearly. Take for instance 2°’ ~ 1 as above. If it has 
a prime divisor p, then 37 must divide p — 1; as p is odd, 
it must be sought among the primes of the form 74n + 1; 
after rejecting the first candidate 149, one tries the next 
one 223 and succeeds (Fe.I1.199). 

Fermat’s enquiry does not stop there. Factorizing numbers 
of the form a” — 1, one notices at once, for n = 2m, that 
one has also to factorize the numbers a” + 1; so Fermat 
asks whether, for any a and any prime p, there always is m 
such that p divides a” + 1. The answer, of course, is negative, 
and Fermat has the criterion for this to happen: if, and 
only if, the smallest n for which p divides a” — 1 is even, 
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then there is m such that p divides a” + 1, and the smallest 
such m is n/2 (Fe.11.209~210). 

For a = 2, another question arises for Fermat, also an 
important one from the point of view of “aliquot parts”: 
when is 2” + 1 a prime? It cannot be so if m has an odd 
divisor d > 1; for, if m = ed, then, putting N = 2°, one sees 
at once that N + I divides 2” + 1 = N%+ 1. The remaining 
case ism = 2’; then 2” + | is indeed a prime for r = 0, I, 
2, 3, 4. Writing to Frenicle in 1640, Fermat enumerates 
such numbers up to r = 6: 


3, 5, 17, 257, 65537, 4294967297, 1844674407370955 1617, 


and conjectures that all are primes. It is hard to believe that 
he did not try to apply, at least to the sixth one 2° + 1, the 
method he had used to factorize 2°” — 1; it shows that any 
prime divisor of 2° + 1 is of the form 64n + 1, which 
leaves the candidates 193, 257, 449, 577, 641, etc.; 641 divides 
2°? + 1; in fact, this is how Euler proceeded, almost a century 
later, incidentally re-discovering Fermat’s theorem on that 
occasion (cf. Chap.III, §1V). What is even more surprising 
is that Frenicle, who had also factorized 2°” — 1, did not at 
once point out the error, as (judging from the general tone 
of their correspondence; cf. Fe.11.185) he would have been 
only too pleased to do; on the contrary, Frenicle expressed 
agreement (Fe.11.208). Fermat persisted in his conjecture 
to the end of his days, usually adding that he had no full 
proof for it (cf. Fe.11.309-310, where he mentions that it 
is important in the matter of “aliquot parts”; cf. also 
Fe.11.405,434). One may imagine that, when he first con- 
ceived the conjecture, he was so carried away by his enthu- 
siasm that he made a numerical error, and then never 
checked his calculation again. As to 2% + 1, it has the prime 
factor 274177, but this was undoubtedly beyond Fermat’s 
range, and even beyond Frenicle’s, though the latter was 
the more stubborn calculator of the two. Even to verify that 
616318177 is a prime must have sorely tried Fermat’s pa- 
tience, if indeed he took the trouble to do it. 
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§V. 


In the years from 1636 to 1640, Fermat turned his at- 
tention increasingly towards two types of questions: dio- 
phantine problems on the one hand, and sums of squares 
on the other. 

Also the latter, of course, were suggested by Diophantus, 
or at least by Bachet’s comments on Diophantus (cf. Chap.I, 
§XI). We see them appear in 1636, at the beginning of the 
correspondence between Fermat and Mersenne. One of 
the first questions to occur in those letters amounts to asking 
whether an integer which is the sum of 2 (resp. 3) rational 
squares is also the sum of 2 (resp. 3) integral squares; as 
previously observed (Chap.1, §XI), Diophantus sometimes 
seems to take this for granted, and so does Bachet in his 
commentary (cf. e.g. his comment on Dioph.V.9 = V.12, 
and on Dioph.V.11 = V.14,). Fermat’s letter to Mersenne 
of 15 July 1636 (Fe.I].29—30) implies that on that date he 
thinks he has proved it, while on 2 September 1636 he 
merely says that he is working at it (“c'est @ quoy je travaille”; 
Fe.11.58). An elementary proof for this has been published by 
L. Aubry in 1912 (cf. Chap. III, App.ID; it is one that Fermat 
could have understood. It is idle, of course, to ask whether he 
could have found it; had he done so, occasions would not have 
been lacking for him to mention it to his correspondents; but 
the matter never turns up again in his letters. 

Obviously, in those years, Fermat was still a raw novice, 
so far as “numbers” were concerned. On 16 September 
1636, he submits to Roberval a problem formulated in the 
terminology of the tenth book of Euclid; it amounts to 
showing that 

x + y? = Wety)z + 2 
has no non-trivial solution in rational numbers. On that 
day, he says he cannot prove it (Fe.1I.62—63). A few days 
later, on 22 September 1636, he has found a proof, but has 


had to work very hard on it (“elle m’a donné grandissime peine”; 
Fe.11.74). His proof, which also Roberval had found in the 
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meanwhile (Fe.11.81,83), is to putt = z + x + y, to rewrite 
the equation as 


Qn? + Qxy + 2y? =f 


and then observe that, taking x, y, ¢ to be integers without 
a common divisor, ¢ must be even, therefore x and y cannot 
both be even, so that x? + xy + y° must be odd. In other 
words, the equation has no non-trivial solution even as a 
congruence modulo 4. One could also observe that the orig- 
inal equation can be written as 


3z7 = (x—-2)? + (y—z)? 


so that the question is to show that 3, which is not a sum of 
two squares in integers, is also not the sum of two rational 
squares; this is a special case of the question mentioned to 
Mersenne a short time before. It could equally well be 
answered by considering the congruence 3z” = u? + v? 
(mod 3). 

Actually, in virtue of “Hasse’s principle” (cf. Chap.IV, 
Appendix J), if F is any indefinite quadratic form with in- 
tegral coefficients, and if F(x) = 0 has no non-trivial solution 
in integers (or, what amounts to the same, in rational num- 
bers), then there is a modulus m such that the congruence 
F(x) = 0 (mod m) has no non-trivial solution. In other words, 
in the case of a single homogeneous quadratic equation, an 
impossibility of this kind can always be verified by a con- 
gruence argument; this applies in particular to the question 
of representing a number by a sum of squares. It is no 
wonder, therefore, that Fermat, at the beginning of his 
career as a number-theorist, should have started with prob- 
lems of that kind. That he did not yet regard congruence 
arguments as essentially trivial only shows his lack of ex- 
perience at that time; on one occasion this must have caused 
him some embarrassment. In 1638, writing to Mersenne 
(Fe.11.66), he announces inter alia, with some self-satisfaction, 
that no integer of the form 8n — 1 can be a sum of less 
than four squares, not only in integers, but even in fractions; 
beside this, he apparently had stated in an earlier letter the 
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similar fact about numbers of the form 4n — I and sums 
of two squares. Mersenne, as was his wont, passed this on 
to Descartes, who at that time thought he had reasons for 
being displeased with Fermat. So far as integral squares are 
concerned, Bachet had already (loc.cit.) given proofs based 
on nothing more than congruences modulo 8 resp. modulo 
4. That this can easily be extended to fractions was dis- 
dainfully pointed out by Descartes; pretending that such a 
trivial matter was beneath his notice, he even left it to be 
carried out by an allegedly unschooled young man, actually 
his former pupil Jean Gillot (Desc. 11.179,195; cf. ibid 11.9 1- 
93, and F. Rabelais, Pantagruel, Livre II, Chap.XVIII). 

It is striking that, in the letter of 1638 to Mersenne, Fer- 
mat’s statement about numbers of the form 8n — 1, with 
which he was unwittingly laying himself open to Descartes’s 
sarcasms, follows immediately the celebrated statement about 
every integer being “the sum of three triangular numbers, 
of four squares, of five pentagonal numbers, and so on”; 
it is the same statement which, perhaps at the same time, 
he inscribed in his Diophantus (Fe.1.305, Obs.X VIII); he was 
to repeat it to Pascal in 1654 (Fe.II.313) and to Digby in 
1658 (Fe. 11.404). Also in his letter of 1638, Fermat challenges 
Sainte-Croix to find two cubes whose sum is a cube, or two 
fourth powers whose sum is a fourth power, with the im- 
plication that he already knew or suspected that there are 
none. We shall never know for sure when, or whether, Fer- 
mat proved all these results. But clearly, if he was still a 
novice in number theory in 1638, his genius had begun to 
shine forth. 


§VI. 


Leaving sums of squares aside for a moment, we turn to 
what has been known since Euler as the theory of quadratic 
residues. 

As we have seen, Fermat’s theorem arose from the wish 
to find out when a prime p divides a number a” ~ 1; a 
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companion to that question, which turns up repeatedly in 
his correspondence with Frenicle in 1640 and 1641, is to 
find out when p divides a” + 1 (Fe.11.209-210,220,226,232). 
In modern terms, call n,(a) the order of a modulo , ie., 
the smallest n > 0 such that p divides a" — 1. By Fermat’s 
criterion (cf. above, §IV), p divides some integer a” + 1 if 
and only if n,(a) is even. On the other hand, as has been 
known since Euler, a is a quadratic residue modulo p if and 
only if (p—1)/n,(a) is even; in view of the law of quadratic 
reciprocity, the primes p with that property are those which 
occur in certain arithmetic progressions of the form 4ax + 6; 
as Euler found out, this can rather easily be discovered “by 
induction” (cf. Chap.III, §VIII). If one considers only primes 
of the form 4n — 1, then n,(a) is even when (p— 1)/n,(a) is 
odd, and vice versa; thus, for such primes, there is a simple 
criterion in terms of quadratic residues for p to have the 
property sought for by Fermat; for the others, there is 
nothing of the kind. Small wonder, then, that Fermat judged 
the problem a very hard one (Fe.11.210)! Characteristically 
perhaps, when he tried to explore it experimentally, he 
immediately made trivial numerical errors which nullified 
his attempt. In a letter sent to Mersenne in 1641 and intended 
for Frenicle (Fe.11.220—221), he asserts (A) that primes of 
the form 12n + 1 divide no integer of the form 3” + 1; 
(B) that every prime 12n + 5 divides some 3” + 1; (C) that 
primes 10n + I divide no 5” + 1; (D) that every prime 10n 
+ 3 divides some 5” + 1; he specifies 11, 13, 23, 37 as 
examples for (A), 5, 17, 19 for (B), 11, 19 for (C) and 7, 
13, 17 for (D). He even promises “for another occasion” 
the general rules for the prime divisors of a” + 1 for any 
a. Actually the primes of type (A) resp. (B) are those for 
which 3 is a quadratic residue resp. non-residue; those of 
type (C) resp. (D) are those for which 5 is a quadratic residue 
resp. non-residue; Fermat’s assertions are true for primes 
of the form 4n — 1, but not necessarily for the others; in 
fact, 37 is a counterexample for (A), since it divides 3° + 1; 
also, 41 divides 5'° + 1 and provides a counterexample 
for (C). Thus Fermat missed his chance of guessing the 
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quadratic reciprocity law; as we know from hindsight, he 
had asked the wrong question. 

It was easier, no doubt, to deal with the quadratic character 
of — 1, although evcn this, at that time, had to be regarded 
as a major discovery. Writing to Roberval in 1640, Fermat 
asserts (Fe.11.204) that, if @ and 6 are mutually prime, no 
prime divisor of a* + 6? can be of the form 4n — 1. Alter- 
natively, if an integer has a prime divisor of the form 
4n — | and has no square divisor, then it cannot be a sum 
of two squares (“not even in fractions”). Writing to Frenicle 
a little later (Fe.II.210), he has a rather cryptic statement 
which, when taken apart, is a direct consequence of the 
theorem he had sent to Roberval; neither Frenicle nor Fer- 
mat, at that stage, seemed willing to disclose their best results 
to one another. When in the same letter Fermat tells Frenicle 
that a prime of the form x? + 2 cannot divide a number 
a” — 2 (Fe.¥1.211), one is left to wonder whether he is 
just testing Frenicle’s knowledge or whether he is being 
deliberately misleading; such a prime, in fact, would 
obviously be of the form 4n — 1 and would divide a’ + x? = 
(a®—2) + (x? +2). 

As to Fermat’s proof, it must have been the same which 
Euler was to discover in 1742 (cf. Chap.III, §Vb). Assume 
that p = 4n — Lisa prime and divides a* + 5’, witha prime 
to b; both a and } must then be prime to p. Using modern 


notation for brevity, we can write a? = —b* (mod p); putting 
m= 2n— 1,p — 1 = 2m, we get a” = — 6°" (mod fp). But, 
by Fermat’s theorem, we have a*” = 6°” = 1 (mod p); this 


is a contradiction. 


§VIL. 


We are so used to the fact that the congruence classes 
modulo a prime p make up a field, viz., the so-called prime 
field F,, and to all the algebraic properties that this implies, 
that we do not feel the need to analyze it any more. Actually 
Bachet’s solution of the equation ax — my = 1 (cf. Chap.I, 
§IV) implies that every integer a prime to m has an inverse 
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modulo m; for m prime this shows that F,, 1s a field. The 
same fact may also be regarded as a consequence of Fermat’s 
theorem, since the congruence a" = | (mod p) implies that 
a has the inverse a”~' modulo p. From this we deduce at 
once, for instance, that if p divides a? + 1°, with a and 6b 
prime to p, then —1 is a quadratic residue modulo p; in 
fact, if a' is an inverse of a modulo p, the congruence = 
~ 1 (mod p) has the solution x = a’b. Thus the result sent 
by Fermat to Roberval amounts in modern language to the 
determination of the quadratic residue character of — 1 
modulo primes of the form 4n — 1. At some time Fermat 
must have become aware of this; some of his later results 
could hardly be explained otherwise (cf. infra, X11). 

The case p = 4n + 1 must have given him more trouble, 
just as it gave more trouble to Euler and gives more trouble 
to the historian attempting to unravel the mysteries of Fer- 
mat’s number theory. Actually, not only does every prime 
p = 4n + 1 divide some sum a’ + 6°, but every prime of 
that form can itself be written as a? + 67, in one and only 
one way. The first to publish a proof for these facts was 
Euler; they are both stated by Fermat, as early as 1640 
(Fe.11.213; cf. Fe.1.314, Obs.XXVI, and Fe.1.293, Obs. VII). 
In all known proofs, one has to begin by proving the first 
part, 1.e., by showing that — 1 is a quadratic residue of p = 
4n + 1; this must also have been the case with Fermat’s 
proof (cf. Fe.11.432, and infra, §VIII). If we consult Euler 
again (Corr.1.494; cf. Chap.III, §VI) we see that Fermat 
could have proceeded as follows. 

Take a prime p = 4n + 1, and, for x, y prime to p, put 
a = x",b = y". We have 

(a? —b*)(a? +b?) = x” — y™. 
By Fermat’s theorem this is a multiple of p, so that p divides 
either a? — b? = x*" — y’" ora’? + b*. Thus, if p divides no 
number a” + 0? with a, b prime to p, it must divide all 
numbers x?” — y?"; for y = I, this gives x°” = I (mod p) for 
l<x<p-l. 

As we know now, a congruence of degree d cannot have 
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more than d solutions in the field F,; in particular, the con- 
gruence x” = 1 (mod p) cannot have p — 1 = 4m solutions. 
For us, this simple remark is enough to clinch the matter. 
For the sake of convenient reference, we formulate as a 


lemma the relevant portion of the result in question: 


Lemma 1. Let p be a prime; then there is nom < p — 1 such 
that a” — 1 1s a multiple of p for all integers a prime to p. 


Did Fermat take this for granted, perhaps on the basis 
of a certain amount of numerical evidence? Of course he 
knew that an equation f(x) = 0 of degree m can have no 
more than m solutions; this is proved for instance in Descartes’ 
Géométrie of 1636 (Desc. V1.444—445), the proof being the 
one that we would give for it now: if f(x) = 0 has the roots 
Xi, XQ, «., X,, One can, by successive division of f(x) by 
X — Xj, X — Xg, ..,X — X, bring down the degree to m — n; 
here it is understood that the underlying field is the field 
of real numbers. In 1772, Euler realized that this is equally 
valid for the field F, (cf. Chap.III, §V1); the steps involved 
are quite easy. But may we credit Fermat, in 1640, with a 
degree of abstract thinking that Euler reached only towards 
the end of his long career? Probably not. 

In 1749 Euler had proceeded differently (loc.cit.); as a 
true disciple of Leibniz and Bernoulli, he used the difference 
operator D defined by 


(Df)(x) = flx+1) — fix); 

it is easily seen by induction on m that, for any m, D"f(x) is 
a linear combination of f(x), f(x + 1), ..., (x +m) with integral 
coefficients. Thus, if, for some m < p — 1, p divides a” — 1 
for | <a < p — 1, it must also divide D"f(1) for f(x) = 
x” — 1; but, as Euler knew, and as is easily seen by induction 
on m, we have D”f = m!; as pis a prime, andm <p — 1, 
this is not a multiple of p. 

Another proof for the quadratic character of — 1 was 
offered by Lagrange in 1771 (Lag.III.425—438), depending, 
not upon lemma 1, but upon “Wilson’s theorem”, i.e., upon 
the congruence (p— 1)! = —1 (mod p) for any prime p. If 
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that congruence is known or assumed, one merely has to 
write, for p = 2m + 1: 


(p—I)l = (1 x2Xx--xm).(p— I\(p— 2) (p-m) 
= (m!)(—1)"m! = (—1)"(ml)®_ (mod fp) 


to conclude that, if m is even, p divides (m!)? + 1. It is by 
no means impossible that Fermat could have, experimentally 
at least, come across the conclusion of Wilson’s theorem; 
Leibniz very nearly did so around 1682 (cf. D. Mahnke, 
loc.cit., p. 42). Is it fanciful to imagine that Fermat could 
then have proceeded in the manner outlined above? 

We know so much more about Leibniz than about Fermat, 
because apparently Leibniz jotted down every thought that 
ever came into his head and preserved every scrap of paper 
that bore his handwriting (cf. D. Mahnke, loc.cit., 
p. 29-30). Such were not Fermat’s habits. 


§VIII. 


At this point, if we allowed ourselves the use of modern 
concepts, we would introduce the Gaussian ring Z[7]; this 
consists of the “complex integers” a + bi = a +bV~1, 
where a and 0 are ordinary integers; as will be explained 
in Appendix I, one has in this ring a “euclidean algorithm”, 
from which it follows that every number in it can be uniquely 
decomposed into prime factors. Then, knowing that every 
ordinary prime p = 4n + 1| divides some sum a? + BD? = 
(a+ bi)(a — bi), while no prime p = 4n — 1 does, one concludes 
that the latter remain prime in Z{:], while the former do 
not; if then, for p = 4n + 1, r + sc is a prime factor of p, 
p has to be equal to r? + s”. 

Bombelli, in Book I of his Algebra of 1572, had given an 
exposition of the theory of the numbers a + bi, perfectly 
sound even from a modern point of view (cf. above, Chap.I, 
§XI1); but there is no sign that Fermat had studied Bombelli, 
and “imaginary numbers” seem always to have remained 
beyond his horizon. Even in the next century, after Euler 
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had dealt with those numbers for many years in connection 
with problems in analysis, it is only quite late in life that he 
introduced them into his number theory (cf. Chap.III, 
§XIV). When Fermat, writing to Mersenne on Christmas 
day 1640, told him that every prime p = 4n + | is in one 
and only one way a sum of two squares (Fe.11.213; cf. Fe.1.293, 
Obs.VII), he must of course have proceeded differently. 
Fortunately he gave us a faint indication of his method in 
his communication of 1659 to Huygens (Fe. 11.432; cf. infra, 
§X). There he says that he had used his “method of descent”, 
showing that if it were not so for some prime it would also 
not be true for some smaller prime “and so on until you 
reach 5”. This may not have seemed quite enlightening to 
Huygens; we are in a better position, because Euler, in the 
years between 1742 and 1747, constructed a proof precisely 
of that kind (cf. Chap.III, 8X); it is such that we may with 
some verisimilitude attribute its substance to Fermat. Actually 
it preserves, in a different arrangement, the various moments 
of the one we outlined above using the Gaussian ring. It is 
more or less as follows. 

Take a prime p = 4n + 1; we already know that it divides 
some suma* + 6° witha, b prime to p. Letrbe the remainder 
of the division of a by p; if r < 2n, put a’ = r; otherwise we 
have 0 <p — r < 2n, and we puta’ = p — r. Construct 8’ 
similarly from 8, and replace a, b by a’, b’. After this is done, 
p still divides a? + b°, and a, b are > 0 and < p/2, so that 
a’ + bis < p’/2 (this is the step which corresponds to the 
use of the euclidean algorithm for Z[7]). The same will hold 
if we divide a, 6 by their g.c.d., so that we may assume that 
they are mutually prime. Write now N = a? + 8°; all the 
prime divisors of N except p are then < p/2 and must be 
either 2 or of the form 4m + 1; we must show that, if all 
are sums of two squares, so is p. Here, as Euler found, one 
could try to proceed in two different manners. 

Firstly, if all the prime divisors of N/p are such, so is N/p, 
in virtue of the fundamental identity 


(1) (a? + b?)(x? +y?) = (axtby)? + (ay* bx)? 
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(cf. Chap.I, §VI). Therefore, writing N/p = x? + y’, we 


have 
a+b ax + by\? ay bx\* 
Zar ae 2,2} + |=! > 
y x+y x+y 


so that p is a sum of two rational squares. This brings us 
back to the question which Fermat had raised but apparently 
not solved in 1636 (cf. above, $V, and Chap.III, Appendix 
II). 

Following Euler (Corr.1.416~417,419), we shall now bring 
the proof to completion by a different route. For the sake 
of brevity, in this § and the next, a pair of integral squares 
with the sum AN will be called a “representation” of N. If 
two integers N, N’ have representations N = a? + 6°, 
N' = x? + y’, the right-hand side of the identity (1) defines 
two representations of NN’, which will be said to be derived 
from those of N and of N’ by “composition”. Now Euler's 
main step can be formulated as follows: 


Lemma 2. For any N = a® + 6°, letq = x° + y? bea prime 
divisor of N. Then N/q has a representation u* + v? such that 
the representation N = a? + 6° is one of those derived by composition 
from it and from q = x? + y?. 

Apply (1) to N and g. We have 
(2) Nq = (ax by)? + (ay*bx)’. 

But g divides the number 
Ny* — b*q = a’y? — b’x? = (ay—bx)(ay + bx) 


and therefore one of the numbers ay + bx. Consequently, 
if the sign is suitably chosen in (2), g divides the left-hand 
side and the second term in the right-hand side, hence also 
the first term. With the same choice of sign, we can now 
write 


(3) ax + by = qu, ay + bx = qu; 
dividing (2) by q’, we get N/q = u® + v®. Now (3), solved 
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for a and b, gives 
(4) a = ux + vy, = x(uy—vx). 


This proves the lemma. 

If now, as before, p is the greatest prime divisor of N = 
a® + b, with a, b mutually prime, and all prime divisors of 
N except p are known to be of the form x? + y, we can 
apply lemma 2 to any one of these, say q, then to any prime 
divisor other than p of N/q, etc., until we obtain for p itself 
an expression as a sum of two squares. This completes the 
proof of Fermat’s statement, as obtained by Euler more 
than a century later. Except for minor details, Fermat’s 
proof could have been much the same. 


SIX. 


As we have seen, the question whether a number N can 
be written as a sum of two squares, either in fractions or in 
integers, originates from Diophantus (cf. Chap.I, §§XI— 
XII); so does the question whether a number N can be the 
hypotenuse of a right-angled triangle “in numbers”, i.e., 
whether N? is a sum of two integral squares; so does also 
the question whether a number can be a sum of two squares 
in more than one way (cf. Chap.I, §V1). 

Perhaps Fermat's first mention of such matters is in his 
marginal note on Dioph.V.12, (Fe.1.313, Obs.XXV), which 
is merely to the effect that 3n cannot be a sum of two squares 
unless 3 divides n; this just amounts to saying that — | is 
not a quadratic residue modulo 3. It is immediately followed 
(Fe.1.314, Obs.XXVI) by a statement of “the true criterion” 
for N = 2m + 1 to be a sum of two squares: the necessary 
and sufficient condition is that m should be even, i.e., N of 
the form 4n + 1, and that, after N has been divided by its 
greatest square divisor, the only prime factors left in the 
quotient should themselves be of the form 4n + 1. This is 
contained in the results discussed above in §VIII. 

Already in 1640, however, when he takes up these ques- 
tions in his correspondence (Fe.I1.213—214; cf. Fe.11.221- 
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222 and Fe.1.293-297, Obs.VII), Fermat does not stop at 
the question whether N = x” + y” has a solution; he asks 
for the number of those solutions, and for a way to find 
them. Partly, no doubt, for traditional reasons, he lays special 
stress on the corresponding questions for “hypotenuses”, 
1.e., on the case when N is a square. 

In counting the number of “representations” N = x* + y? 
of an integer N, we will pay no attention to the signs of 
x and y, nor to their order; this agrees in substance with 
Fermat’s usage; of course he would always have taken x, y 
to be natural integers. A representation N = x° + y? will 
be called “proper” if x and y are mutually prime. If a rep- 
resentation N = x” + y° is not proper, ie., if x, y have a 
g.c.d.d > 1, one can write x = dx’, y = dy',N = d°N', and 
(x’, y') is a proper representation of N’. Thus all questions 
about representations can be reduced, if one wishes, to similar 
questions about proper representations; this is implicitly 
understood in Fermat’s treatment. In particular, if N has 
a proper representation, its odd prime factors must be all 
of the form 4n + 1; at the same time, N cannot bea multiple 
of 4, since x” + y* cannot be a multiple of 4 unless x, y are 
even. If now N = 2N’, and N’ is odd, then every repre- 
sentation N’ = x’? + y of N’ gives a representation N = 
x? + y? of N with x = x’ + y’, y = x’ — y’; conversely, if 
N =x? + y*, x and y must both be odd and we can put 

"= (xt+y)/2,y' = (x—y)/2,N’ = x? + y (cf. (4), Chap.1, 
§VI). These formulas determine a one-to-one correspond- 
ence between the representations of N and those of N’; if 
one is proper, so is the other. Thus it is enough to discuss 
representations for odd integers. 

The first point on which Fermat rightly insists is that 
every prime of the form 4n + | has a unique representation. 
Actually this follows at once from the result we have for- 
mulated as lemma 2 in §VIII; taking N = qin that lemma, 
we get N/q = 1 = u? + v’, so that (u, v) is (+1, 0) or 
(0, +1); (4) shows then that (a, 6) defines, in the sense ex- 
plained above, the same representation of ¢ as (x, y); obviously 
it is a proper one. 
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From the same lemma it follows now that any represen- 
tation of an integer N can be obtained by “composition” 
from a factoring of N into its prime factors. If, following 
Fermat, we call “primitive” the representations of primes 
by sums of two squares, one may say that all representations 
can be derived from the primitive ones by a systematic pro- 
cess, viz., by composition. This is indeed specified by Fermat 
for the case of “hypotenuses”, i.e., for representations of 
squares (Fe.11.222); undoubtedly he was aware of it also in 
the general case. As he could hardly have arrived at such 
aconclusion by mere trial and error, this gives an additional 
argument for assuming that he was in possession of some 
such result as lemma 2. 

As to the number of representations of an integer, Fermat 
has the following statements (Fe.11.213—215; cf.Fe.1.293- 
297, Obs. VII): 


(I) If p is a prime of the form 4n + 1, and r = 1, then 
N = p’ has exactly r/2 representations if r is even and 
(r+ 1)/2 if r is odd. 
(II) If m ts any integer, take all the distinct primes of the 
form 4n + | dividing m; let a, , ..., -y be their exponents 
in m; then m is exactly h times a hypotenuse, i.e., m?” 
has exactly h representations, with h given by 
2h + 1 = (2a+1)(2B84+1)... Qy+1). 

The letter to Mersenne containing (I) and (II) is dated 
25 December 1640. Only a few months earlier (Fe. I1.202- 
204), he had only been able to tell Roberval that a prime 
p = 4n — | cannot divide the sum of two mutually prime 
squares (i.e., that — I is a non-residue modulo p = 4n — 1), 
adding that “nothing he had ever found about numbers 
had pleased him as much as the proof of that result”. What 
a giant stride forward in less than half a year! 

In making the statements quoted above, Fermat does not 
specify that he has proofs for them; nevertheless one may 
plausibly try to reconstruct his reasoning. As we have said, 
it is enough to deal with proper representations. If we go 
back to lemma 2 and its proof, we see from formula (4) 
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that, if (a, 6) is a proper representation for N, (u, v) must 
be one for N/g. This has a converse, which we also formulate 
as a lemma: 


Lemma 3. Let q = x° + 9° be an odd prime divisor of N; assume 
that N/q is >2 and has the proper representation N/q = u? + v°. 
Let (a, b) and (a', b') be the two representations for N, arising 
from (u, v) and (x, y) by composition. Then these two representations 
of N are distinct; they are both proper if q does not divide N/q; if 
q divides N/q, one is proper and the other is not. 


In fact, we have 
a= ux + vy, b = uy — ux; a’ = ux — vy, b' = uy + vx, 
Clearly a # +a’, and itis easily seen thata # +b’. We have 
ax + by = uq, ay — bx = vq, 


and similar formulas for a’, b’; as (u, v) is proper, this shows 
that no prime, other than q, can divide a and 3, or a' and 
b'. If q divided both a and a’, it would divide a + a’, 
a ~— a’, hence u and v; as this is not so, either (a, 5) or 
(a’, b’) must be proper. If ¢ divides, say, a and b, q’ divides 
N = a’ + 6°, and q divides N/q; therefore, if it does not, 
both (a, b) and (a’, b’) are proper. Finally, if q divides N/q, 
we write, just as in the proof for lemma 2 


bb’ _ uy? _ yx? _ 7 _ vq; 


this is a multiple of g, so that g divides b, hence also a (as 
shown by N = a? + 6°) or b’, hence also a’. This completes 
the proof. 

There cannot be much doubt that Fermat must have pro- 
ceeded more or less along such lines, even though he would 
of course not have expressed his ideas in this fashion and 
might well have contented himself with a careful analysis 
of typical numerical cases. Probably he might have begun 
with the case of a power P = p’ of a prime p = 4n + 1; 
applying lemma 3, or some similar result, to such a power, 
one finds at once that it has one and only one proper rep- 
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resentation; this, in substance, is equivalent to Fermat’s 
statement (I). If now N is a product of p powers P,, Po, ..., 
P, of distinct primes, the lemma shows that it has 2°~' proper 
representations, arising by composition from those of the 
factors P;, since each additional factor multiplies the number 
of proper representations by 2; (II) follows easily from this 
fact. One could also obtain a formula for the number of 
representations, proper or not, of an arbitrary integer, but 
this would not be quite as simple as (II). 

One consequence of the above results must be pointed 
out here, because of the role it must have played in some 
of Fermat’s later work on diophantine equations (cf. infra., 
§XVI); we shall formulate it as a lemma. 


Lemma 4. Let N° be a cube with a proper representation N 3 = 
a’ + b°. Then N has a representation u® + v* such that a = 
u> — 3uv*, b = v® — 30u?. 


As N* has a proper representation, N must be odd and 
must be a product of powers P,, ..., P, of p distinct prime 
factors of the form 4n + 1; then so is N°, and both have 
2°! proper representations, all arising by composition from 
those of the factors P; resp. P?. Consequently, each proper 
representation for N* must arise by composition from one, 
and only one, of N; in particular, N must have a represen- 
tation (u, v) such that, when it is first composed with itself 
and then with the resulting representation of N?, one gets 
the given one (a, b) for N*. If the signs for u, v are chosen 
properly, one obtains the formulas in the lemma. One would 
get the same result more easily by making use of the ring 
Z[z] of Gaussian integers (cf. Appendix I). There is of course 
a similar result for any power N’ with r = 2, and it can be 
proved in the same way. The case r = 2 is nothing else than 
Euclid’s lemma on pythagorean triangles (cf. Chap.I, §V) 
and does not require the elaborate machinery needed for 
lemma 4. 

Thus the theory of the representations of numbers by 
sums of squares may be regarded as essentially complete, 
so far as “composite” (i.e., non-prime) integers are concerned. 
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There remains the problem of actually finding what Fermat 
calls the primitive representations, i.e., those of the primes 
p = 4n + 1. In 1654 he told Pascal that he had found a 
systematic method for this (“une régle générale”: Fe.11.313; 
cf. infra, $X1II). In 1641, however, he knew of no such 
method, and had nothing better to offer than trial and 
error (“tdtonner”: Fe.11.223). 

It is noteworthy that Fermat, “writing hurriedly” (as he 
says) on Christmas day 1640, still finds time to point out 
the analogy between numbers of representations by the 
form X? + Y?, and similar numbers for X? — Y?, or rather, 
as he formulates it, for the number of times that N can be 
written as x —~ y with xy a square. This condition gives 
x = du’, y= dv*, N = d(u?—v’), where d is the g.c.d. of x 
and y, so that u, v are mutually prime. For N odd, the number 
of solutions is indeed given by the same formula as in (ID), 
except that all the prime divisors of N are to be taken into 
account, and not merely those of the form 4n + 1. The 
same question is mentioned on another occasion (Fe. 11.256— 
258, of doubtful date), in connection with a method for 
factoring large integers. Take for N a large odd integer; 
let n be the largest integer <VN. The traditional way of 
factoring N is to try to divide it successively by all primes 
from 3 up to n. Instead of this, Fermat tries to write it as 
N=x - ys as x must be >n, this can be done by writing 
the sequence 


(n+ 1)? — N, (n+2)? — N, (n+3)? —N,... 


until one finds a square in it. As successive differences of 
consecutive numbers in that sequence are 2n + 3,2n + 5, 
etc., the calculation can be carried out rather easily, as shown 
by Fermat’s example 


N = 2027651281, n = 45029, 
where he finds 
N = 450412 — 10207 = 46061 x 44021. 


The procedure is quite effective if N has divisors close to 
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VN. As a means of testing whether a given large number 
is or is not a prime, Fermat, writing to Huygens in 1659, 
had to acknowledge that it is far from perfect (Fe.I1.435), 
in spite of many shortcuts which he says he has discovered. 
Of course some progress has been made in this since Fermat; 
but it is diverting to observe that supposedly “unbreakable” 
codes are being constructed nowadays on the assumption 
that the factoring of very large numbers is beyond the power 
of our most sophisticated computers. 


§X. 


Inthe tantalizingly brief account of his number-theoretical 
work sent to Huygens ved Carcavi in 1659 (Fe.11.431—436 = 
Huy.11.458—462) Fermat has the following to say about his 
early work: 

“As ordinary methods, such as are found in the books, 
are inadequate to proving such difficult propositions, I dis- 
covered at last a most singular method ... which I called 
the infinite descent. At first I used it only to prove negative 
assertions, such as: “No number of the form 3n — 1 can 
be written as x° + 3y? ”, “There is no right-angled triangle 
in numbers whose area is a square”... . To apply it to af- 
firmative questions is much harder, so that, when I had to 
prove that “Every prime of the form 4m + | is a sum of 
two squares”, I found myself in a sorry plight (“en belle 
peme’’). But at last such questions proved amenable to my 
method...” 

Was it pure negligence that Fermat, in introducing his 
infinite descent to Huygens, included the first statement 
about numbers 3n — 1? Are we faced there with a copying 
error? Our text is based, not on an autograph of Fermat, 
but on a copy made by Huygens. By the time it was written, 
Fermat had long known (and surely Huygens knew very 
well) that the statement in question can be proved by a 
trivial congruence argument modulo 3. There had been a 
time when Fermat had not been aware of this (cf. above, 
§V); should one imagine that the first proof he had worked 
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out for that statement (and perhaps for the companion 
statement about sums of two and four squares; cf. §V) had 
been one by infinite descent, before he realized that a con- 
gruence argument would suffice? However that may be, 
the passage quoted above indicates that his proof for the 
second statement must have preceded his work of 1640 
about sums of two squares (cf. above, §1X). This is in part 
confirmed by his letter to Mersenne (Fe.I1.65, presumably 
to be dated 1638; cf. above, §11), where he challenges Sainte- 
Croix to find a right-angled triangle whose area is a square, 
a cube equal to the sum of two cubes, etc.; clearly he must 
already have known, or at least suspected, that these are 
problems without solutions. In 1640 he sends the same (and 
another impossible one) to Frenicle through Mersenne in 
order to test Frenicle’s theoretical knowledge in such matters; 
“if he says that there is no solution up to this or that limit”, 
Fermat tells Mersenne, “be assured that he proceeds by 
tables”. There is no sign that Frenicle gave any answer. 

In his Traité des trangles rectangles en nombres, posthumously 
published in 1676 (and reprinted in 1729, Mém.Acad.Sc.t.V, 
pp. 127-206), Frenicle did include a proof that the area of 
a pythagorean triangle can neither be a square nor twice a 
square. In view of Fermat’s statement to Huygens (Fe.1I.436), 
one may safely assume that this was based on a commu- 
nication from Fermat; the lack of an acknowledgment, in 
a manuscript which the author did not see through the 
press, does not imply that Frenicle was claiming credit for 
those proofs. In that same Traié, Frenicle gives “induction”, 
i.e., eXperimentation, as his only reason for assuming that 
every prime of the form 4n + 1 is a sum of two squares 
(cf. infra, §XI1(ii)); all we know about him indicates that he 
was an experimentalist, in contrast with Fermat whose fore- 
most interests were theoretical. 

Writing to Huygens about his proof by descent for the 
theorem on the area of pythagorean triangles, Fermat merely 
described the method by saying that “if the area of such a 
triangle were a square, then there would also be a smaller 
one with the same property, and so on, which is impossible”, 
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he adds that “to explain why would make his discourse too 
long, as the whole mystery of his method lies there” 
(Fe.11.432). Fortunately, just for once, he had found room 
for this mystery in the margin of the very last proposition 
of Diophantus (Fe.1.340—341, Obs.XLV); this is how it goes. 

Take a pythagorean triangle whose sides may be assumed 
mutually prime; then they can be written as (2pq, p°—4q’, 
p’+q°) where p, g are mutually prime, p > q, and p — q is 
odd (cf. Chap.I, §V). Its area is pq(p+q)(p—q), where each 
factor is prime to the other three; if this is a square, all the 
factors must be squares. Write p = x°,q = y°,p tq = wv’, 
p — q = v’, where u, v must be odd and mutually prime. 
Then x,y and z = uvareasolution of x* — y* = 2°; incidentally, 
v®,x*, u* are then three squares in an arithmetic progression 
whose difference is y* (cf. Chap.1, §VI, and Fe.11.65, problem 
4°). We have u® = v” + 2y?; writing this as 2y? = (u+v)(u—v), 
and observing that the g.c.d. of u + v and u — vis 2, we 
see that one of them must be of the form 2r° and the other 
of the form 4s”, so that we can write wu = 7? + Qs°, 
+vu = 1° — 25°, y = 2rs, and consequently 

x? = Hu? +0") = 1r* + 454. 
Thus r*, 2s* and x are the sides of a pythagorean triangle 
whose area is (7s)” and whose hypotenuse is smaller than 
the hypotenuse x* + y* of the original triangle. This completes 
the proof “by descent”. 

Frenicle follows this proof faithfully, with little more than 
verbal changes (loc.cit. pp.173—175). As to triangles whose 
area is twice a square, he proceeds as follows (ibid., pp. 175— 
176). As above, one can write p + q = u’, p — q = v*, and 
either p = x°, g = 2y’, or p = 2x”, q = 9°. As u, v are odd, 
and 2p = u® + v*, p must be odd, and so we can write 
p = x°,q = 2y*. Then 4y? = (u+v)(u—v); as the g.c.d. of 
u + vand u — vis 2, we can write u + v = 2r°,u — v = 28°, 
u=r+s*,v =7° — s*, and finally 

x = ur tv*) = 71 + st 
Then the triangle (r?, s°, x) has the area 2(rs/2), and the 
proof is complete. 
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Fermat's descent (Chap., §X), from pp. 338-339 of the Diophantus 
of 1670. 
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From this, as Frenicle observes, and as Fermat must have 
known, a number of consequences follow. For instance, the 
fact that the equations x* + y* = z® have no non-trivial 
solutions is included in the above proofs (cf. Fe.I.327, 
Obs.XXXITI). Also, in a pythagorean triangle (a, b, c), a 
and 6 cannot both be squares, since then the area ab/2 would 
be twice a square; a and c cannot both be squares, since 
a=r°,c = s° would give s* — r* = 6’. From this one can 
deduce Fermat’s statement (Fe.1.341; cf. Fe.11.406) that no 
triangular number other than | is a fourth power. Triangular 
numbers are those of the form n(n + 1); if this is a fourth 
power, one of the integers n, n + 1 must be of the form x* 
and the other of the form 2y*, so that we have x* — 2y" = 
+1. Ifx > 1, consider the pythagorean triangle a = x", 
b = 4(x*—1), ¢ = 4x*+1); in one case a and b would be 
squares, and in the other case a and c would be such, con- 
tradicting what has been proved before. Similarly, by con- 
sidering the triangle a = x°z?, b = 3(x'-2'),¢ = 3x¢+2'), 
one sees that the equations x’ + z* = 2¢° have no non-trivial 
solutions; such, more or less, is the proof that Euler gave 
for this result in 1738 (Eu.1-2.47—49 in E98). 


SX. 


Still following the rough outline supplied by Fermat’s 
communication of 1659 to Huygens, we now turn back to 
quadratic residues and quadratic forms; it will appear later 
(cf. infra, §X VI) that the practice of the “method of descent” 
in the study of diophantine equations provided Fermat with 
a powerful incentive for proceeding in that direction. 

To summarize at once what will be discussed presently, 
Fermat’s results on that subject come under the following 
headings: 

(i) The statement about every integer being a sum “of 
three triangular numbers, four squares, five pentagonal 
numbers, etc.”; this is asserted as early as 1638 and frequently 
reiterated (cf. above, §V). As to triangular numbers, it 
amounts to saying that every integer of the form 8n + 3 is 
a sum of (at most) three squares. Other statements about 
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sums of three squares appear in the letter of 1658 to Digby 
(Fe.11.405) and in a marginal note to Diophantus (Fe.1.314— 
315, Obs.XXVII) which has all the appearances of belonging 
to an early period of Fermat’s career. Cf. infra, §XIV. 

(ii) As early as 1641, Fermat seems to have been aware 
of some of the main facts about the quadratic form X* — 
2Y? (cf. Fe.11.221,224—226); so apparently was Frenicle (cf. 
Fe.11.231-241), who specified, however, in his Traité des 
triangles (loc.cit.) that his results had been obtained “by in- 
duction”, i.e., experimentally. Frenicle’s main motivation 
in such matters came from the study of pythagorean tri- 
angles; his interest in the form X* — 2Y° arose from the 
fact that the sum and the difference of the two minor sides 
of a so-called “primitive” triangle 


(a, b, c) = (2pq, p -q, p +q°) 
(i.e., one for which a, b, c are mutually prime; cf. Chap.I, 
§V) are given by that form: 


|o+al = |p°—q° + 2pq| = \(p+q)°— 24". 

As to Fermat, one gets the impression that very soon it was 
the quadratic form as such that attracted his attention. 

(ili) Fermat’s extant correspondence with Frenicle, in- 
cluding the communications which passed between them 
through Mersenne, is entirely concentrated within the years 
1640 and 1641. From then on, and until 1657, Frenicle’s 
name occurs but rarely in Fermat's letters. Indeed, as a 
glance at volume II of his Zuvres will show, his correspond- 
ence between 1642 and 1654 has little to offer in the way 
of mathematical content. It is only in 1654 that we learn 
for the first time of the progress Fermat has made with his 
number theory in all these years; after an exchange of letters 
with Pascal on probabilities (of great importance, but of no 
concern to us here), Fermat musters enough self-confidence 
to send to Pascal some of his discoveries on “numbers” 
(Fe.11.312—313). What is new there is that, for the first time, 
the quadratic forms X? + 3Y*, X° + 2Y* appear, as well as 
the mention of a “general rule” for finding the two squares 
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in the representation p = a” + 6° of any prime of the form 
4n + 1. We find the form X? + XY +Y? mentioned in the 
notes on Diophantus (Fe.I.300—301, Obs.X—XII), but this 
is of course not to be separated from X? + 3Y?, as shown 
by the identities 


A(x + xy+y*) = (2x49)? + 3y’, 
x? + By? = (xFy)? + Qy(xFy) + (2y)*. 


What Fermat says he has proved concerning these forms 
is as follows: every prime of the form 3n + I can be written 
asx? + 3y? (Fe.11.313,403; cf. Fe.1.301, Obs. XII); every prime 
of the form 8n + 1 or 8” + 3 can be written as x° + 2y° 
(Fe.11.313,403); every prime of the form 8n + 1 can be 
written in infinitely many ways as |x?— 2y| (JEH.41). The 
latter fact had been known (experimentally, cf. above, (ii)) 
to Frenicle; it is stated in his letter of 1641 to Fermat 
(Fe.11.235). 

(iv) In 1654, Fermat had signally failed to arouse in Pascal 
an interest for number theory. In 1656 the English adven- 
turer and double agent Sir Kenelm DIGBY (“an arrant 
mountebank”, as Evelyn dubbed him; “the very Pliny of 
our age for lying”, according to another of his contempor- 
aries) visited Toulouse; obviously he must have met Fermat; 
either on that occasion or later, Fermat received from him 
Wallis’s Arithmetica Infinitorum, which had been printed in 
1656 (cf. Fe.11.337, and above, §III). This episode revived 
in Fermat the hope of finding, at least across the seas, a 
sparring partner worthy of himself; his correspondence 
with Digby, and, through Digby, with the English mathe- 
maticians WALLIS and BROUNCKER occupies the next 
year and a half, from January 1657 to June 1658. It begins 
with a challenge to Wallis and Brouncker, but at the same 
time also to Frenicle, Schooten “and all others in Europe” 
to solve a few problems, with special emphasis upon what 
later became known (through a mistake of Euler’s) as “Pell’s 
equation”. What would have been Fermat’s astonishment 
if some missionary, just back from India, had told him that 
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his problem had been successfully tackled there by native 
mathematicians almost six centuries earlier (cf. Chap.I, §TX)! 

Various diophantine equations, both in integers and in 
fractions, are also included in that correspondence; some 
have already been considered above, e.g., the impossibility 
of a pythagorean triangle of square area; of others more 
anon. Here, where we are concerned with quadratic forms, 
we merely take note of a conjecture about the form 
X? + 5Y?; Fermat had observed experimentally that, while 
primes of the form 20n + 3, 20n + 7 cannot be written 
as x° + 5y”, any product of two such primes can be so written; 
he mentions (Fe.1I1.405) that he is convinced of this, but 
cannot prove it. As has been known since Lagrange (cf. 
Lag J11.775—6) this is just where the question of the class- 
number enters the picture. No wonder Fermat was puzzled! 

(v) Finally, in his communication of 1659 to Huygens, 
Fermat, while repeating some of the above statements, refers 
to his work on Diophantus’s “simple and double equations”. 
Most of these lead to curves of genus 1 (cf. infra, §XV); but 
Fermat gives two examples which belong to the theory of 
quadratic forms. Take for instance, he says, the equation 
2x? + 7967 = y; “I have a general rule for solving this if 
possible, or else for finding that it is impossible, and I can 
do it, whatever the coefficients may be”; to this he adds a 
further example’, the “double equation” 2x + 3 = y’, 3x+ 
5 = 2, which amounts to 2z* — 3)? = 1. The reference to 
Diophantus indicates that here Fermat asks for solutions 
“in fractions”, i.e., for rational solutions. In other words, 
he refers to the general problem, successfully treated by 
Lagrange in 1768, of solving any equation x* = Ay* + Bz 
in integers. 


? In Fe.11.435, line 2, 2N + 5 isa misprint for 3N + 5 (cf. Fe.1V.140, 
line 2, and Huy.11.461, line 10). As to the first equation, Fe.1V.139, line 
2 from b., has 1967 instead of 7967; it matters very little which one is 
the correct reading; note that 7967 = 31 x 257 and 1967 = 7 x 281. 
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§XII. 


Now we shall take up the topics listed in §X1; we begin 
with (iii) and (ii). 

In Chap.I, §VIII, we have described the two formulas 
(5) and (6) as being of ancient origin; here it will be enough 
to write both as 


(5) (x? + Ay’)(z?+ Al) = (xz Ayt)? + A(xt¥yz)’, 


where A may be either a positive or a negative integer; the 
times when the transition from one case to the other could 
have caused some difficulty were long past. This point of 
view was familiar to Fermat and not a few of his contem- 
poraries; they may have regarded (5) as a rather obvious 
generalization of the special case A = 1. 

In §§VII, VIII and IX, we attempted, with Euler’s help, 
to reconstruct Fermat’s procedure concerning sums of two 
squares by means of lemmas 2 (§VIII) and 3 (§1X). Now, 
still following Euler, we note that lemma 2 subsists without 
any change, with its proof, if we replace the form X* + Y? 
by X® + AY®. Of course composition has now to be understood 
as based on the identity (5). For (3) and (4), one has then 
to substitute 


ax + Aby = qu, ay ¥ bx = qu, 
a=ux+ Avy, 6b = +(uy—vx). 


From this extension of lemma 2 one concludes, just as in 
§1X, that, if A > 1, a prime q cannot be written as x” + Ay” 
in more than one way; if it could, our result, applied to q 
and to N = q, would give 1 = u® + Av*. On the other hand, 
this same remark leads at once to Pell’s equation if A is 
negative; this will be discussed more in detail in §XIII. 

Again, just as in § VIII, we conclude from the lemma that, 
if a prime p divides a number N = a? + Ab”, and if every 
one of the other prime divisors of N can be written as 
x” + Ay’, then the same is true of p. We now reconsider in 
this light the proof given in §VIII for the form X* + Y?. 
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Let us for a moment adopt Euler’s terminology, calling an 
odd prime p a prime divisor of the “formula” (or “form’”) 
X? + AY? if it divides some integer N = a® + Ab? with a 
and 6 prime to p; this is so if and only if —A is a quadratic 
residue modulo p (cf. §VII). Let p be such; take a, b prime 
to pand such that p divides a” + Ab’. Let r be the remainder 
of the division of a by p, and put a’ = ror p — 7, whichever 
is smaller; 6’ being defined similarly from }, we can replace 
a, 6 by a’, b’, and then divide a’, b’ by their g.c.d.; in other 
words, we may assume that a and 6 are mutually prime and 
< p/2. PutN = a + Ab’. If A = 2, we have N < 3p7/4, so 
that all prime divisors of N, other than p, are <3p/4; then, 
just as in 8VIII, we may, using “complete induction”, assume 
that they can all be written in the form x* + 2y* (note that 
this includes the prime 2 if N happens to be even), and, 
applying lemma 2, we conclude that also p can be so written. 
Take A = 3; then we get N < p’; thus the same reasoning 
is valid if N is odd. If N is even, a and b must be odd, and 
we have a = + b (mod 4) if the sign is properly chosen. 
Then a’ = }(a + 36) and b' = 3(a ¥ b) are integers, and so 
is N/4 = a’ + 36"; we can then replace N by N/4, and so 
on if necessary, until we have replaced N by an odd integer 
with the same properties. Thus every prime divisor of X” 
+ 3Y* can be written as x” + 3y°. We can proceed similarly 
for A = —2 since in that case 0 < a, b < p/2 implies |N| < 
p’/2. As observed above, the representation p = 
x” + Ay® must be unique if A = 2 or 3, but of course not if 
A = —2, as appears from the identity (5) combined with 
1=3?-2x 2 Incidentally, as Euler was to observe, the 
identity (5) shows that, for A = 1, +2, 3, all odd divisors 
(and not only all odd prime divisors) of any integer N = 
a’ + Ab’ with a prime to 6 can be written as x? + Ay’. 
Obviously every integer x” + 3y°, if not a multiple of 3, 
must be of the form 3n + 1; therefore the above results 
imply that every prime divisor of X* + 3Y?,i.e., every odd 
prime for which — 3 is a quadratic residue, must be of that 
form; to complete the proof of Fermat’s statement of 1654 
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about such primes, one has to prove the converse. This is 
the same kind of difficulty which was pointed out in §VII 
concerning the form X* + Y*; following Euler again, we 
write p = 3n + | and use the identity 


x" — 1 = (x"—1)(x"+x"4+ 1). 


For all x prime to p, p divides the left-hand side; in view of 
lemma I, §VII, we conclude that, for some x prime to p, p 
must divide the second factor in the right-hand side, and 
therefore also 


A(x?" +x" +1) = (2x"+1)? + 3. 


This shows that p is indeed a divisor of X* + 3Y?. 
As to the forms X* + 2Y’, take first p = 8n + 1 and 
write 


x — 1 = (x= 1)[(0"F 1)? + 2(x")*]. 


As above, using lemma 1, we conclude that p is a divisor of 
X? + 2¥? and of X*? — 2Y°. Now let p be of the form 8n — 1 
or 8n + 3. If p = 8n — 1, it cannot be a divisor of X* + 2Y°’, 
for we have seen that such a divisor would itself be of 
the form x” + 2y? and therefore of the form 8n + 1 or 
8n + 3. For a similar reason, if p = 8n + 3, it cannot be 
a divisor of X? — 2Y*. Now in both cases p is not a divisor 
of X? + Y?, as has been shown in §VII; therefore, if we put 
p = 2m + 1, no two of the 2m integers 


2 92 2 2 2 2 
]*, OF. te — 1 oan — OR 


can be congruent modulo p, so that they exhaust the 2m 
congruence classes prime to p modulo p. This shows that, 
for every a prime to p, one of the integers a, —a is a quadratic 
residue and the other is not. Take p = 8n +3; as p is not 
a divisor of X? — 2Y?, 2 is not a quadratic residue modulo 
p; therefore —2 is such a residue, which is the same as to 
say that pis a divisor of X? + 2Y”; then it can itself be written 
as x” + 2y*. Similarly, if p = 8n — 1, it is a divisor of 
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X* — 2Y* and can be written as x” — 2y*. This completes 
the proof of Fermat’s statements about the forms in question. 

Concerning the number of representations of integers 
by the forms X* + 2Y?, X® + 3Y°, Fermat must have known 
that whatever method he had followed for sums of two 
squares could be applied with almost no change; lemma 3 
of §1X remains valid for these forms, as well as its proof, 
provided one adds the assumption that q > 3, N/q > 3 if 
the form is X* + 3Y*. Just as in §1X, one can conclude from 
this that all representations of an integer N can be derived 
by composition (in the sense of identity (5)) from those of 
the primes dividing N. In particular, every proper repre- 
sentation of an integer N’ can be derived by composition 
from one of N. For r = 2, this can be verified elementarily, 
just as in the case of sums of two squares (cf. above, §IX); 
in fact, the case r = 2, for the form X° + 2Y?, has already 
played a role in Fermat’s proof described in §X. At least 
the case r = 3 must also have been known to Fermat, since 
it is needed in some of the proofs to be discussed below in 
§XVI; we shall formulate it here as a lemma: 


Lemma 5. Assume that N° = a? + Ab’, with a, b mutually 
prime, and A = 2 or 3. Then one has N = u? + Av’, with u, v 
such thata = u? — 3Auv", b = 3u7v — Av’. 


The proof is similar to that of lemma 4 in §1X; of course 
it has now to be based on (5). Here, too, one would obtain 
an easier proof by making use of the ring Z[V— 2] resp. 
Z[j] (cf. Appendix I). 

The case of the form X* — 2Y* is more complicated, since 
here the solutions of x° — 2y° = +1 have to be taken into 
account (cf. Chap.], §§VIII and IX). Some essential facts 
about it were known to Fermat already in 1641 (cf. loc.ctt. 
§X1(i)), but his main statements are contained in his com- 
munication to Frenicle of which the latter got a few copies 
printed and added to his own pamphlet of 1657 (JEH.41- 
44). His motivation at that time was the study of the dio- 
phantine equation 


(6) (2X21)? = ey? - 1. 
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Fermat told Huygens in 1659 that this has no other solution 
in integers than (1, 1) and (2, 5) and that he has sent a proof 
of this to Frenicle (Fe.11.434 = Huy.11.461, and again 
Fe.11.441 = Huwuy.11.538). Genocchi proved it rather easily 
in 1883 (Nouv.Ann.Math.{II1) 2, 306-310), using the fact 
(proved by Euler, and possibly known to Fermat; cf. above, 
§X) that the equations X* + Y* = 2Z? have no solution. 
For Fermat, equation (6) turned up in connection with a 
problem included in his challenge of 1657 “to the English 
mathematicians and all others”, viz., to find a cube which, 
added to the sum of its divisors, makes up a square. As an 
example, he mentioned 7°, which gives 1 + 7 + 7? + 7° 
= 20° (Fe.11.332), but soon he added that he knew of other 
solutions (Fe.1I.342). 

As was also noticed by Frenicle, if the cube is p*, with p 
a prime, Fermat’s condition can be written 


l+pt+p? + p = (pt+l\(p?+1) = a square. 


As 2 is not a solution, p + 1 and p + | have the g.c.d. 2 
and must be of the form 2x’, 2y’, where (x, y) is a solution 
of (6). Frenicle could not show that there is no such prime, 
but found out somehow that there is none between 7 and 
10” (cf. Huy.11.25 and JEH.23—24). Thus, in dealing with 
(6), Fermat’s attention was drawn more particularly to the 
case where p = 2x* — Lisa prime. This is how he proceeds. 

From every representation n = x? — 2y° of a positive or 
negative integer n by the form X? — 2Y*, one deduces “by 
composition” the representations 


—n = (x+2y)? — Q(a+y)? 


of —n; this is no more than the case A = —2,z =¢= lof 
(5) (cf. Chap.I, §VIII). Thus, in dealing with that form, it 
is enough to consider only representations either of positive 
or of negative integers. In part because of the application 
to (6), Fermat restricts himself to the latter, 1.e., to repre- 
sentations n = 2y° — x°® of positive integers n, where of 
course he takes x, y to be positive integers, and we shall do 
the same; then 0 <x <y V2. Also by composition, i.e., by 
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applying (5) to the case A = —2,z = 3,t = 2, we see that 
either one of the substitutions 


(7) (x, y)—>([3x— 4y], 3y — 2x) 
(8) (x, 9) —>(3x + 4y, By + 2x) 


transforms any representation n = 2y? — x° of n into another 
one. Among these, as Fermat now observes, there is always 
one for which 0 = x < y; in fact, if x > y, (7) transforms 
(x, y) into (x’, y') with x’ <x, y’ < y, so that, by iterating (7) 
as often as may be needed, one comes to a representation 
(xo, Yo) with x9 < yo; such a representation is called “le plus 
petit couple” by Fermat; we shall call it minimal. Applying (7) 
to it again, one gets a representation (xo, yo) with xo = yo, 
xo 2 Xo, Yo = Yo; applying (8) repeatedly to (xo, yo) and to 
(xo, 9), One gets two sequences (x;, y;), (xj, yi) of represen- 
tations of one and the same integer n, all distinct from one 
another unless x9 = 0 or xo = yo. From this Fermat concludes 
rightly that every solution of n = 2y? — x* can be derived 
by this process from a minimal one; “the proof of all this 
is easy”, he writes, even though a critical reader may well 
feel that some further elaboration would be welcome. For 
the modern reader, the matter could best be clarified by 
working in the ring Z[V2] (cf. Appendix 1); to write 
n = 2y* ~ x? is the same as to write —n = N(é) = €é' with 
€=xt yV2, é’ =x -~ yV2; the substitution (8) amounts 
to € > «*é with e = 1 + V2, and (7) to € > & *é or to 
&é— —«°€', as the case may be; but it is hardly to be imagined 
that Fermat, even heuristically, would have proceeded in 
this manner, in spite of the precedent offered by Euclid in 
his tenth book (cf. Chap.I, §VIID). Anyway, the above treat- 
ment, applied to the case n = | and the “minimal repre- 
sentation” | = 2 x 1° — 1°, supplies the fact that all solutions 
of 1 = 2y* — x* can be derived from that one by the repeated 
application of (8), i.e. by repeated composition with the 
representation | = 3° — 2 x 2. Again, by composition 
with the minimal representation, this produces all solutions 
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of x? — 2y° = 1, thus showing that Theon’s “side and diagonal 
numbers” (cf. Chap.I, §VIII) give all the solutions of 
x? — QyP = +], 

Fermat’s further deductions must have followed the pat- 
tern set by his theory of the form X? + Y®, even though it 
cannot be said that this pattern emerges clearly from his 
communication to Frenicle. Just as he had proved that a 
prime p = 4n + 1 has one and only one representation as 
a sum of two squares (cf. above, §§VIII and IX), he must 
have found, using similar arguments, that any two repre- 
sentations of a prime by the form X? — 2Y? must be such 
that they can be derived from each other by composition 
with a representation of +1 (cf. the proof of lemma 2 in 
§VIII, and the above remarks about its extension to forms 
X? + AY’); in view of what has been shown above, it amounts 
to the same to say that a prime p can have only one minimal 
representation p = 2b° — a® with 0 < a < 6. Then, by 
composition, one derives from this a representation 
p = 2B — A®, with 


(9) A= \a® + 26° — 4ab}, B = a + 2b? — 2ab, 


of which it is easily verified that it is minimal. Conversely, 
if p is any odd prime, assume that p” has the representation 
p = 2B? — A®; this may be assumed to be minimal, so that 
0<A<B; we have then Pp = A” — 9B” with A’ = 2B + 6A, 
B' = B + 6A, 8 = +1; since A must be odd, 6 may be 
chosen so that A’ = p + 2 (mod 4). This gives 


2B’? = (A'—p)(A' +p), 


from which (just as in Fermat’s proof in §X) one can conclude 
that A’, B’, p can be written as 


il | 


A’ =a? + 2b’, B' =2ab, p=2b-a@ 


where a, } are positive integers; then A, B are given by (9). 
Moreover, the inequalities p > 0, A < Bif 6 = +1, and 
p> 0,A > 0if8 = —1, imply (when #, A, B are expressed 


Fermat and His Correspondents 91 


in terms of aand 6) a < b, so that p = 26? — a? is the minimal 
representation of p. 

Now assume, as in Fermat’s problem, that p = 2x7 — 1 
and p* = 2y? — 1; these are minimal representations of p 
and of p”, respectively. Applying to the latter what has been 
proved above, i.e. taking in it A = | and B = y, we find 
that A and B are given by (9), where p = 20° — a’ isa 
minimal representation of p. As we have just seen, however, 
a prime p can have only one such representation; therefore 
we have a = 1, 6 = x, and (9) gives now: 


1 = |1+2x°—4x 


> 


which is easily seen to have no other solutions than x = 1 
or 2, giving p = 1 (which is not a prime) and p = 7. This 
completes the proof, so far as primes are concerned. 

While Fermat is far from explicit on some of the points 
considered above, one may still plausibly argue that he could 
have filled in the missing links, had he cared to do so. Un- 
luckily his communication to Frenicle, brilliant as it is, does 
not end there. “What has been proved for primes”, he writes, 
“can be extended, as you know, to composite numbers; . . . 
I need not dwell at length on such easy matters”. Therefore, 
he says, equation (6) has indeed no other solutions than 
(1, 1) and (2, 5). 

His argument, however, fails for “composite” (i.e., non- 
prime) numbers, because such integers have in general more 
than one “minimal” representation, as he and Frenicle must 
have known for many years (cf. loc.cit. §X1(ii)). Apparently 
he had trapped himself by an imprudent use of the words 
“le plus petit couple”, translated above by “minimal”. Of course, 
for a given n, the smallest solution of n = 2y? — x? is “min- 
imal”, but the converse is not true. 

There is no sign that Frenicle detected the fallacy in Fer- 
mat’s final argument; nor did J. E. Hofmann when he pub- 
lished Fermat’s piece in 1944. Indeed one may, without 
unfairness, suspect that the proof went somewhat over his 
head. Nevertheless, on this occasion at least, it seems clear 
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that Fermat’s quickness of mind had tripped him. The rep- 
utation for infallibility that some of his admirers have sought 
to build up for him is no more than a legend. 


8X11. 


The study of the form X* — 2Y* must have convinced 
Fermat of the paramount importance of the equation 
x? — Ny? = +] to be solved in integers, or, in modern 
parlance, of the units of real quadratic fields. As we have 
seen in Chap.I, §§VIHI-IX, Archimedes and the Indians 
had trodden this path long before him; of course he could 
not have known about the Indians, nor about Archimedes’s 
“problema bovinum” (the epigram discussed in Chap.I, §1X). 
He had read Theon of Smyrna (Fe.11.266), and therefore 
knew about the “side and diagonal numbers” (cf. Chap.I, 
§VII1), but this did not go beyond the case x* — 2y? = +1. 

When Fermat, in 1657, offered the equation x” — Ny’ = 1 
as a Challenge problem to the English and all others, he 
was asking for solutions in integers; this had not been spec- 
ified in the challenge that went to England, but “it goes 
without saying”, as he wrote to Frenicle (Fe.11.334), since 
any novice (“le moindre arithméticien”) can obtain a solution 
in fractions. It is just such a solution in fractions, however, 
that Wallis and Brouncker sent him at first (Fe.111.418). 
When Fermat pointed out to them that this was not what 
he meant, did he expect to make them confess their ig- 
norance? Did he feel a touch of annoyance when, after only 
a few months’ delay, he received from them a complete 
method of solution? In his answer, sent as usual through 
Digby, he says that he “willingly and joyfully acknowl- 
edges” the validity of their solution (“lbens agnosco, imo et 
gaudeo ...”, Fe.11.402). Writing privately to Huygens the 
next year, however (Fe.I1.433), he points out that the English 
had failed to give “a general proof”. Huygens had already 
said as much to Wallis (Huy.II.211). A general proof, ac- 
cording to Fermat, can only be obtained by descent. 
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It may thus be surmised that Fermat’s method of solution 
did not greatly differ from the one he got from Wallis and 
Brouncker®, but that he had been able to extract from it a 
formal proof of the fact that it always leads to a solution. 
The method to be summarized now is the one which Wallis 
credits to Brouncker (Wal.II.797; for more details, cf. 
A. Weil, Coll.Papers I11.413—420); it is the most satisfactory 
variant among several that are described in that corre- 
spondence, and is equivalent (in substance) to the Indian 
cakravala method (Chap.I, §1X) as well as to the modern 
treatments based on continued fractions. 

The starting point here is to assume the existence of a 
solution (x,y) of x* — Ny* = I, and to transform the problem 
by successive steps into others, each of which has a solution 
in smaller numbers; as may be seen from Fermat’s treatment 
of the pythagorean triangles of square area (above, §X), 
this is typical of Fermat’s descent; the difference lies in the 
fact that in the latter case one seeks to prove that there is 
no solution, while in the former the purpose is to find one. 
At the same time, Bachet’s method for the solution of the 
equation ay — bx = +1 (cf. Chap.I, 8IV) undoubtedly pro- 
vided a model for Wallis, Brouncker and Fermat to follow, 
depending as it does on the following observation. For 
a > 6 > 0, assume that the equation aY — bX = +l hasa 
solution (x, y) in positive integers (both not 0, since for Bachet 
and his contemporaries 0 was no more “a number” than 1 
had been in the days of Euclid); of course a and 6 must be 
mutually prime. Write a = bm, + ¢ withO <c¢ <6, and put 
x = my + xz. Then we have bz — cy = ¥1, and (y, z) isa 
solution in positive integers, respectively smaller than x and 
y, of the equation bZ — cY = #1; this is similar to the 
original one, and incidentally has smaller coefficients. Re- 
peating the process, one must eventually reach an equation 


8 Indeed J. Ozanam, describing a solution, identical with Brouncker’s, 
on page 509 of his Nouveaux éléments d’algébre of 1702, and illustrating 
it by treating the cases N = 23 and N = 19, attributes it to Fermat. Did 
he know more than we do, or had he merely misread the Commercium 
Epistolicum? 
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rV — sU = + 1 withr > s > 0, admitting a solution (u, v) 
with u > v = 1; this implies that r = su + 1; pushing the 
process either one or two steps further, as a modern arith- 
metician would do, one reaches a point where the solution 
is (1, 0). Anyway, for the final equation, the solution is ob- 
vious, so that, running the process in reverse, one obtains 
a solution for the original problem. This is precisely the 
kuttaka method of the Indian mathematicians (cf. Chap.I, 
§§IV and IX); as pointed out in Chap.I, it does not differ 
substantially from Euclid’s algorithm for finding the g.c.d. 
of two integers, nor from the calculation of the continued 
fraction for a/b. The modern reader may also note that the 
first step in Bachet’s solution can be written 


(23) = (65)-(20) (24). 


so that the whole process amounts to writing the matrix on 
the left-hand side as a product of the well-known generators 
of the group GL(2, Z). 

Brouncker’s method for solving Fermat’s problem follows 
the same pattern. Let N be a positive integer, not a square; 
assume that the equation U? — NX* = +1 has a solution 
(u, x). Let n be the integer such that n? < N < (n+ 1)?; this 
gives u > nx, so that we may put u = nx + y withy > 0. Put 
at the same time A = N — n*, B = n, C = 1. Then (x, y) 
is a solution of the equation 


(10) AX? — 2BXY - CY? = +1 


where A, B, C are positive integers such that B? + AC = N. 
This will now be transformed by successive steps into equa- 
tions of the same form with smaller and smaller solutions, 
leading (just as in Bachet’s method) to the solution of the 
original problem. 

Brouncker’s procedure is to write x = my + z, where m 
is the largest integer less than the positive root of the equation 
At? — 2Bt — C = 0; he seems tacitly to assume that, for 
this choice of m, z must be >0 and <y. Then (jy, z) is the 
solution of an equation similar to (10), and the procedure 
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may be continued until one finds an equation with an obvious 
solution (u, 1); as this happens in all the numerical cases 
which Wallis and Brouncker used to test their method, in- 
cluding those which Fermat had specifically mentioned, they 
saw no need for digging deeper. Fermat knew better. 

In order to fill the gaps in the above treatment, it is nec- 
essary to point out the special features of (10) which permit 
the successful application of Brouncker’s process. For brev- 
ity’s sake, we will adopt a modified Gaussian notation, writing 
(A, B, C) for the quadratic form in the left-hand side of 
(10), where it is understood that A, B, C are positive integers 
and not 0; moreover, also by a slight modification of Gaussian 
usage, we will say that (A, B, C) is reduced if the positive root 
of At? — 2Bt — Cis >1 and the negative root is > — 1; this 
is so if and only if |A—C]| < 2B, and then (C, B, A) is also 
reduced. For A = N — n*, B = n, C = 1, (A, B, C) is 
obviously reduced. Now, for any reduced form F = 
(A, B, C), let again m be the largest integer less than the 
positive root of Af? — 2Bt — C. If we put 


A' = —(Am?—2Bm-C), B' = Am — B, C’ =A, 


m is the largest integer for which A’ > 0; it is then easily 
seen that B’ > 0, that F’ = (A’, B’, C’) is again reduced, 
and that the substitution 


xX = mX' + Y'~ Y = x’ 


changes F into —F’. If N = B* + AC is the “determinant” 
of F, F’ has of course the same one. As there can be no 
more than finitely many forms (A, B, C) with a given de- 
terminant N, it follows that the successive quadratic forms 
F, F', etc. produced by Brouncker’s process must eventually 
repeat themselves. This would surely have been noticed by 
Wallis and Brouncker if only they had pushed the treatment 
of their numerical examples a little further than they did. 

Notations being as above, we will say that F’ = (A’, B’, C’) 
is the form derived from F = (A, B, C). As any example 
will show at once, the form (C, B, A) is then the one derived 
from (C’', B’, A’); indeed we have 


C = —(C'm?-2B'm—A’), B= C'm—B', A=C', 
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and m is the largest integer for which C’m*> — 2B’m — A’ <0. 
Now, starting from any given reduced form Fy = (Ao, By, Co), 
we get by Brouncker’s process a sequence F, = Fo, Fy = Fy, 
etc., where for each i the form F; = (A;, B;, C;) is the one 
derived from F;_,; in an obvious sense, we may say that F; is 
the 7-th derived form of Fy. All of these are reduced forms 
with the determinant N; as we have seen, they must repeat 
themselves, so that we have F;,, = F; for some 2 2 0 and 
some p = 1. For each 2, put G; = (C;, B;, A;); then Gp is the 
i-th derived form of G;, and G, is the i-th derived form of 
Gi+p. AS Fiz, = Fi, we have G;,, = G;, and therefore G, = 
Go, F, = Fo; then, for all k, we have Fy, = Fo. We have thus 
proved that Brouncker’s process leads eventually to a repetition 
of the initial form Fy, and to a periodic sequence of forms 
from then on; if the starting point was Brouncker’s, i.e., the 
form (N —n*, n, 1) we must have C, = 1 for some p > 1, and 
therefore A,_; = 1. Consequently, the equation F,.; = 1 has 
the trivial solution (1, 0), from which, running the process in 
reverse, one deduces a solution of the original problem. As 
the sign of the right-hand side, in the successive equations 
F(X, Y) = +1, alternates from one equation to the next, it is 
+(—1)'*' if one has started originally from U? — NX? = +1. 


Therefore the solution (1, 0) for F,-, = 1 gives a solution 
(u, x) for the equation U? — NX? = +] if p is even, and for 
U? — NX? = —1if pis odd. In the latter case, in order to get 


a solution for U* — NX* = + 1, one has only to run the process 
through to F2,, or more simply (what in fact amounts to the 
same) apply the basic identity (5) to the case A = —N,x = 
z,y = t. In fact, we have seen in Chap.I, §1X, how the Indian 
mathematicians used the identity (5) (the so-called bhdvand) 
for shortcuts in the cakravdala process whenever a solution for 
U* — NX? = —1, +2, +4 has been found. As Wallis and 
Brouncker also discovered, precisely the same shortcuts can 
be applied to Brouncker’s method whenever, in running the 
process through, one finds a form F; with A; = 2 or 4. They 
also noticed that the whole process may sometimes be shortened 
by making (with the same notations as before) the substitution 


X = (m+1)xX' — ¥', Y = x’ 
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instead of the one used above; in modern terms, this amounts 
to building up a so-called “half-regular” continued fraction 
for VN instead of the “regular” one; in the cakravala method, 
as described in Chap.I, 8IX, it corresponds more or less to 
the prescription that at each step the integer denoted there 
by N - x” should be made small in absolute value, rather 
than small and >0. As pointed out at the time, such shortcuts 
can be numerically useful but make the theoretical discussion 
much more cumbersome. 

The facts underlying the above treatment can be observed 
quite easily on each numerical case to which Brouncker’s 
process is applied. Wallis and Brouncker treated among 
others the case N = 13, which is quite typical and later 
became Euler’s favorite example for his description of their 
method, e.g., in his Algebra of 1770 (Eu.I-1.383). Fermat, 
writing to Frenicle in 1657, proposes the cases N = 61 and 
N = 109, adding (misleadingly, or rather mischievously) 
that he has chosen quite small numbers “pour ne vous donner 
pas trop de peine”; he must have known, of course, that the 
smallest solutions in these two cases are respectively 


(1766319049, 226153980), 
(15807067 1986249, 15140424455 100), 


and selected the values 61 and 109 precisely for that reason. 

A final step is still required in order to show that the 
process leads to all solutions, or, in modern terms, that the 
group of units of a real quadratic field is generated by the 
one obtained by the above method. For this, one has only 
to start again (as we did heuristically, and as Wallis and 
Brouncker did without looking any further) from a given 
solution (u, x) of U? — NX® = +1. This was transformed 
by the substitution u = nx + y into a solution (x, y) of the 
next equation F(x, y) = ¥1, then into a solution (x’, y’) of 
the next equation F’(x', y') = +1 by the substitution 
x = mx’ + y'. Experience shows that this leads to smaller 
and smaller pairs of integers, and ultimately to a pair (x, 1) 
and then to (1, 0), which is where the process terminates; 
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but, for this, one has to know that 0 < x’ <x,0<y' <y, 
etc. As to the initial step wu = nx + y, the assumption 
u> — Nx’ = +1 gives 


nx? — 1 <u? < (n+1)*x’ + 1; 


this implies nx < u < (n+ 1)x; it implies nx < u < (n+1)x 
unless x = 1. As to the next step and all following ones, 
assume that (x, y) satisfies 


F(x, y) = Ax? — 2Bxy — Cy? = +1. 


Put f() = F(t, 1), e() = f) + y*. As the roots of f lie in 
the intervals [ — 1, 0] and [m, m+ 1], we have 


f(-1)>0, f(0)<0, fim <0, f(m+1)>0. 
As these four numbers are integers, this implies 
g(—1I)>0, 90) <9, om) <0, g(m+1)>0 


unless y = 1. As our assumption on (x, y) implies that x/y 
is a root of gy, this proves that x/y lies in the interval 
Im, m+ 1], as we wished to show. 

One more point deserves mentioning. In 1654, writing 
to Pascal about his “inventions” on “numbers”, Fermat cites 
a “general rule” for finding the representation of any prime 
q of the form 4m + 1 as a sum of two squares (cf. above, 
§IX), but, as usual, gives no details. However, the application 
of Brouncker’s process to N = gq, and for instance to any 
one of the cases N = 13, 61, 109, shows at once that a form 
F, with A; = C, will occur in the sequence of reduced forms 
produced by that process; then we have g = B; + A}. Thus, 
for N = 13, one finds Fy = (3, 2, 3); for N = 61, Fs = 
(5, 6, 5); for N = 109, F; = (3, 10, 3). This could well be 
Fermat’s “general rule”. 

A more careful observation would show even more: in 
those cases, the period (Fo, F), ..., F,-1) in that sequence of 
forms has p = 27 + | terms, with) as above. A formal proof 
for this can be given as follows. Let (u, x) be the smallest 
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solution for u? — gx” = 1; here u must be odd and x even, 
so that we may write 


ut] url _ x\? 
9 Q Ao) ° 


The factors in the left-hand side are mutually prime since 
they differ by 1, and g must divide one of them; therefore 
one of them is of the form v’, the other of the form gt’, and 
we have v? — gt? = +1. The sign cannot be +, since then 
(u, x) would not be the smallest solution of u® — gx? = 1. 
This gives v? — gi? = —1. As we have seen before, this 
implies that the number p of terms in the period 
(Fo, Fy, «.., Fpy-1) is odd; put p = 27 + 1. Then Fy.) = Fo, 
Cy; = 1 and consequently Aj; = 1. Now observe that, if 
a form (1, B, C) with the determinant N = B® + Cis reduced, 
we must have -1 <B—VN <0,B=n,C =N — n?;in 
particular, we get F,;, = (1, n, N-—n’). As before, put G; = 
(C;, B;, Ai) for all 7; we get Ga; = Fo. As we have seen, G, is 
the j-th derived form of Gy;, te., of Fo; it is therefore the 
same as F,, and we have A; = C;, as was to be proved. 

It does not seem unreasonable to assume that this may 
have been, in general outline, what Fermat had in mind 
when he spoke of his proofs in his letters to Pascal and 
Huygens; how much of it he could have made explicit must 
remain a moot question. Perhaps one of his most serious 
handicaps was the lack of the subscript notation; this was 
introduced later by Leibniz, in still imperfect form, and its 
use did not become general until rather late in the next 
century. Faced with a difficult or complicated proof, Fermat 
might well be content with a careful analysis of some typical 
numerical cases, convincing himself at the same time that 
the steps involved were of general validity. Actually he never 
succeeded in writing down his proofs. Were they such as 
Fermat himself, in his criticism of Wallis (cf. 8111), has insisted 
that they should be in order to provide “the foundations 
for a new branch of science”? This is what we shall never 
know. 
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SXIV. 


Now we come to item (v) of §XI. Just as Euler served us 
as a guide for looking at the forms X*°+ AY* for A = I, 2, 3 
and Wallis and Brouncker in discussing “Pell’s equation”, 
here we shall enlist the services of Lagrange (Lag.11.390— 
399). 

We start from the equation ax” + by’ + cz” = 0, to be 
solved in rational numbers; a, 6, c are assumed not to have 
the same sign. Multiplying with c, and replacing z by z/c, 
we can rewrite this as 27 = Ax + By’, where A and B are 
not both negative; they may be assumed to be squarefree, 
for otherwise, if e.g. A has a divisor m* > 1, we can replace 
x by x/m. We may take notations so that |A] < |B]. It is of 
course enough to look for solutions in integers; in fact, we 
may require x, y, z to be mutually prime; for, if e.g. x and 
z have the common divisor d > 1, d? will divide z? and Ax’, 
hence By’; as B is squarefree, d must then divide y, and we 
can replace x, y, z by x/d, y/d, z/d. 

As any common divisor of x and B must now divide z, 
x must be prime to B, so that it has an inverse x’ modulo |B}. 
As Ax? = 2? (mod |B]), we have A = (x'z)® (mod |B]). This 
shows that the given equation can have no solution unless 
Ais a quadratic residue (not necessarily prime to B) modulo 
|B|; whether this is so or not can be determined by trial and 
error if the more sophisticated methods based on the law 
of quadratic reciprocity are not available. 

Now assume that A is such; put A = a” (mod |B)). After 
replacing a either by the remainder r of the division of a 
by |B| or by |B] — r, whichever is smaller, we may assume 
that 0 <a < 3[Bj. As A is squarefree, we have A # a” unless 
a = +1,A = 1. Discarding this case as trivial, we can write 
a’ — A = BB,d’, where d is taken so that B, is squarefree. 
We have 


a’—A 
B 


and therefore |B,| < |B| except in the trivial case B = +1, 


|B,| < <8 +1 
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A = +1. On the other hand, our basic identity (5) of §XII 
gives 


B,(Bdy)* = (a? — A)(z?— Ax”) = (az+Ax)® — A(ax+2z)’. 


If we put X = ax + z, Y = Bdy, Z = az + Ax, this shows 
that (X, Y, Z) is a solution of Z2 = AX? + B,Y?. 

As A = a’ (mod |B,|), this process can now be repeated 
until we reach an equation of the form Z? = AX? + B,Y* 
with |B,| < |A]. We can then put A’ = B,, B’ = A; in order 
to proceed further, A’ must be a quadratic residue modulo 
|B'|. Continuing the descent in this way, with steadily de- 
creasing values of the coefficients A and B, we must either 
reach an equation of the form Z? = MX® + NY? where M 
is not a quadratic residue modulo |N|, in which case the 
original equation itself has no solution, or an equation 
Z? = X* + NY? with the obvious solution (1, 0, 1); in the 
latter case, one obtains a solution of the original equation 
by reversing the process. 

It will be apparent that this is a typical method “by descent”, 
quite in conformity with Fermat’s general principles; in fact, 
the underlying ideas are quite close to those described in 
§XIII for dealing with Pell’s equation (and even more so, 
perhaps, to the Indian procedure described in Chap.I, 81X). 
Here again we have no evidence for believing that Fermat 
proceeded in this manner, except for his statement that he 
had solved the problem and for the fact that it fits in rather 
well with what we know of his methods. 

We are in a far worse position if we attempt to deal with 
his famous statement about every number being a sum of 
“three triangles, four squares, five pentagons, etc.” (cf. above, 
88V and XI(i)). It occurs as early as 1638 (Fe.11.65), at a 
time when it seems most unlikely that he could have had 
any kind of proof for it. Writing to Pascal in 1654, he men- 
tions it as being so far his most important result (Fe.1I.313), 
adding merely that the proof rests upon his theorems on 
sums of two squares. In his Diophantus (Fe.1.305, Obs.X VII), 
he writes that “it depends upon the deepest mysteries” in 
number theory, to whose exposition he plans to dedicate a 
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whole volume. Writing for Huygens in 1659 (Fe.11.433), he 
repeats only the assertion about four squares, omitting the 
other parts of his earlier statement; should one attach some 
significance to this omission? This time he adds that the 
proof is by showing that “if some number were not of this 
kind” (i.e., not a sum of no more than four squares), “there 
would be a smaller one with the same property, etc.” Indeed 
proofs of that kind are known (cf. Chap.III, §XI); they are 
more or less similar to the corresponding proofs for sums 
of two squares, but they presuppose a knowledge of Euler’s 
identity on the product of sums of four squares, which is 
not an easy one to discover; certain it is that there is no 
trace of it in Fermat’s writings. 

As to “triangles” and sums of three squares, the mystery 
is even deeper. That every number is a sum of three “tri- 
angles” amounts to saying that every integer of the form 
8n + 3 is a sum of three squares; from this, as Legendre 
showed in his Théorie des Nombres (3° éd.,t.11, pp.33 1-356), 
one can derive (not quite easily, but at any rate elementarily) 
all of Fermat’s further assertions about “four squares, five 
pentagons, etc.” Sums of three squares are also mentioned 
in a marginal note to Diophantus (Fe.I.314—315, 
Obs.XXVII); here the question raised by Diophantus is 
whether an integer N = 3n + | is a sum of three squares 
(in integers, or perhaps in fractions); for this to be so, Fermat 
Says, it is necessary (and by implication sufficient) that n 
should not satisfy any of the congruences 


n=(1444+42 4-4 4'~') + 2 x 4 (mod 8 x 4’). 


This note has all the appearances of belonging to an early 
period of Fermat’s career. His condition amounts to saying 
that the number N = 3n + 1 must not be of the form 
4’ (8m + 7), which is indeed the condition for N to be asum 
of three squares; Fermat’s knowledge of this fact can only 
have been based “on induction”, since as late as 1658 he 
was unable to prove that 2p is a sum of three squares if p 
is a prime of the form 8n — 1; this latter fact is stated as 
“true in the sense of Conon”, i.e., conjectured but still un- 
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proved, in his last communication to Wallis (Fe.I1.405). To 
us, the chief interest of the above-mentioned observation 
on Diophantus is that it gives in substance the 2-adic 
expansion 

tet sy gp eg 4 4 (mod 4) 

3 1-4 

But, as to “sums of three triangles”, as long as no proof is 
produced which could with some plausibility be attributed 
to Fermat, the mystery must remain. 


§XV. 


If one takes literally the terms of Fermat’s communication 
of 1659 to Huygens, one has to conclude that he took up 
such diophantine problems as x° + y> = 2,27 + 2 = y¥, 
x’ + 4 = y’ ata comparatively late stage of his career. The 
first one of these, however, occurs already in a letter to 
Mersenne in 1638 (Fe.1J.65), and again, along with a few 
more of the same kind, in 1640 (Fe.I1.195), with the clear 
implication that these equations have no solution except 
the obvious ones. In 1659 his memory may have been at 
fault; more probably he wishes only to refer to the time 
when he had obtained for these questions a reasonably sat- 
isfactory treatment, rather than to what, even in 1640, may 
have been a mere conjecture based on plausibility arguments. 
Certainly a period of intensive work on diophantine equa- 
tions of genus | began for him around 1641, when problems 
about pythagorean triangles were proposed to him by Fre- 
nicle (“for whose love”, he once said, he worked “on numbers”; 
cf. Fe.11.241,244,265); of course both of them must have 
known their Diophantus in and out. Eventually, as we learn 
from Digby (Fe.I1.362), Frenicle’s interest in such matters 
slackened; fortunately Fermat’s did not, or at least his Jesuit 
correspondent Billy, by dint of skilful questioning seasoned 
with effusions of lavish praise, succeeded in drawing him 
out long enough to obtain from him the material for what 
later became his Inventum Novum (cf. above, 81). 
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As we have observed in Chap.I, §X, the most significant 
problems in Diophantus are concerned with curves of genus 
0 or 1. With Fermat this turns into an almost exclusive 
concentration on such curves. Only on one ill-fated occasion 
did Fermat ever mention acurve of higher genus, and there 
can hardly remain any doubt that this was due to some 
misapprehension on his part, even though, by a curious 
twist of fate, his reputation in the eyes of the ignorant came 
to rest chiefly upon it. By this we refer of course to the 
incautious words “et generaliter nullam in infinitum potestatem” 
in his statement of “Fermat’s last theorem” as it came to be 
vulgarly called: “No cube can be split into two cubes, nor 
any biquadrate into two biquadrates, nor generally any power 
beyond the second into two of the same kind” is what he wrote 
into the margin of an early section of his Diophantus 
(Fe.1.291, Obs.II), adding that he had discovered a truly 
remarkable proof for this “which this margin is too narrow 
to hold”. How could he have guessed that he was writing 
for eternity? We know his proof for biquadrates (cf. above, 
§X); he may well have constructed a proof for cubes, similar 
to the one which Euler discovered in 1753 (cf. infra, §XV1); 
he frequently repeated those two statements (e.g. 
Fe.11.65,376,433), but never the more general one. For a 
brief moment perhaps, and perhaps in his younger days 
(cf. above, §IIT), he must have deluded himself into thinking 
that he had the principle of a general proof; what he had 
in mind on that day can never be known. 

On the other hand, what we possess of his methods for 
dealing with curves of genus | is remarkably coherent; it 
is still the foundation for the modern theory of such curves. 
It falls naturally into two parts; the first one, directly inspired 
by Diophantus, may conveniently be termed a method of 
ascent, in contrast with the descent which is rightly regarded 
as more peculiarly Fermat’s own. Our information about 
the latter, while leaving no doubt about its general features, 
is quite scanty in comparison with Fermat’s testimony about 
the former (e.g. Fe.1.291-292, Obs.II]; Fe.1.297-299, 
Obs. VITI-IX;  Fe.1.322-325, Obs.XXX; Fe.1.328—329, 
Obs. XXXIV; Fe.1.334—-339, Obs.XLITI—XLIV) and with the 
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abundant (and indeed superabundant) material collected 
by Billy in the Inventum Novum (= Fe.I1I.325—398). 

In modern terms, the “ascent” is nothing else than a 
method for deriving new solutions, and in most cases in- 
finitely many, from a few “visible” solutions for the equations 
of a curve of genus |. What was new here was of course 
not the principle of the method; it had been applied quite 
systematically by Diophantus (cf. Chap.I, §X), and, as such, 
is referred to, by Fermat (loc.cit.) as well as by Billy, as “meth- 
odus vulgaris”, i.e., as the traditional method (cf. also Bachet’s 
prolix commentary to Dioph.V1.24,). The novelty consisted 
in the vastly extended use which Fermat made of it, giving 
him at least a partial equivalent of what we would obtain 
by the systematic use of the group-theoretical properties of 
the rational points on a standard cubic (cf. infra, Appendix 
III). Obviously Fermat was quite proud of it; writing for 
himself in the margin of his Diophantus (Fe.1.334, Obs.XLIID), 
he calls it “nostra inventio”, and again, writing to Billy: “at 
has astonished the greatest experts” (“C’est une de mes inventions 
qui a quelquesfois estonné les plus grands maistres, et particuherement 
Monsieur Frenicle ...”; Fe.11.437). The record shows that 
Fermat treated in this manner the following types of prob- 
lems, most of which occur already in Diophantus: 

(1) “Double equations” of the form 

Ax? + Bx + C = wv’, A’x? + B'x + Cl = v0, 
provided either A and A’, or C and C’, are squares; it may 
be assumed that the left-hand sides have no common zero 
on the projective straight line, since otherwise this would 
define a curve of genus 0 (for these, cf. §XIV above). For 
our purposes, we will regard the curve as embedded in 
projective space, the homogeneous coordinates being 
(1,x, u,v). IfA = a’, A’ = a”, we have four rational points 
at infinity 


P... = (0,1, +a, +a’). 


Similarly, if C = c*,C’ = c’®, we have four “trivial” solutions 


Q. + = (1, 0, ze, +c'), 
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Of course, in Fermat’s eyes, just as in Diophantus’s, these 
are no “solutions”. The traditional method, inherited from 
Diophantus (cf. Chap.I, §X), was to write, e.g. in the case 
A=@,A' =a": 


(a'ut+av)(a'u-—av) = B"x + C" 
with B” = A’B — AB',C" = A’C — AC’, and then 


1 
aut+av =t, a'u—- av = 7 Bix +0), 


l ( B’x a 
“u= sai t+ . 
2a t 


Substituting the latter value for u into the first one of the 
given equations, and taking t = +%, with fy) = B"/2aa’, we 
obtain a linear equation for x, hence a solution. To clarify 
this in modern terms, consider, on the given curve, the 
function 

f= aut av — bt; 


it is easily seen that this has simple poles at P, , and P__, 
no pole at P_,, and a zero at P,_; in other words, one 
has constructed the point M given by the equivalence 
M~P,. + P__ — P,- (cf. Appendix I); om for 
t = ~—ty, one gets the point M’ ~ P,, + P__ — P_ 

the case C = ¢*, C’ = ce’, the construction is of course suite 
similar. If all of A, A', C, C’ are squares, Fermat has an 
additional construction, based (in modern terms) on the 
functions a’(utc) + a(vtc'), which produces the points 
equivalent to P,, + P__ — Q4,, ete. 

Now, after obtaining, by any of those means, a rational 
point M = (xo, uo, vo), Fermat substitutes x + x9 for x, ob- 
taining a new pair of equations where the constant terms 
are squares, after which the process can be started again, 
and so on indefinitely; this, he says, is “his invention”. Of 
course it could not escape him that in some special cases 
the process may lead again and again to the same solutions; 
this corresponds to cases where one has started with points 
of finite order in the group of solutions. But usually he 
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does not hesitate to assert that he is getting infinitely many 
solutions (cf. e.g. Fe.11.248—249,259-260,263), and invariably 
he is right. 

(II) Next, consider the equation 


Ax* + Bx? + Cx? + Dx + E = 9°; 


in modern terms, we may regard it as defining a two-sheeted 
covering of the projective line; we assume of course that 
the left-hand side has no multiple root, and also that A or 
E is a square. If A = a”, there are two rational points P,, 
P_ at infinity, respectively given by the expansions 


y= ta? t+bxtcot+-) 


with b = B/2a,c = (4AC ~ B*)/8Aa. The “methodus vulgaris” 
is then to substitute for y, in the given equation, the value 
y = ax’ + bx + c, which leads to a linear equation in x. In 
the language explained in Appendix II, this amounts to 
considering the function y — ax* — bx — c, which has a 
double pole at P_ and a zero at P,; its remaining zero is 
then M ~ 2P_ — P,. Similarly, if E = e”, we have the two 
points Q. = (0, +e); putting y = a’x” + b’x +e with suitable 
a' and b', we get a point M’ ~ 2Q_ — Q.. If we have both 
A =a’ and E = e’, then, following Fermat, we can use the 
substitution y = +ax® + mx + e, with m chosen so. that the 
equation obtained in this way, after division by a power of 
x, becomes linear in x; this leads to points ~2P. — Q., 
2Q. — P.. The process can then be iterated as above in 
(I), substituting x + xo for x after a solution (xo, yo) has 
been found. 
(III) Similar methods can be applied to an equation 


Ax’? + Bx? + Cx + D=y’, 
where one assumes that D = d’, or to an equation 
Ax? + Bx’ + Cx + D =y¥’, 


where either A or D is a cube. In these cases, the method 
amounts to the familiar one of intersecting a cubic with the 
tangent to the curve at a given rational point. Was Fermat 
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aware of this geometric interpretation? In his work on an- 
alytic geometry, he had dealt with the question of con- 
structing the tangent to a plane curve given by its equation, 
and had treated in particular the “semicubic parabola” 
ay = x, the “cissoid” x(x?+y°) = ay’, and the “con- 
choid” (x*+y*)(y—a)? = by? (cf. Fe.1.159-161 and 218). 
For lack of evidence, this intriguing question must remain 
unanswered. 

On the other hand, it is noteworthy that the simple idea 
of intersecting a cubic with the straight line through two 
previously known rational points does not seem to have 
occurred to Fermat, either in geometric or in algebraic garb. 
One finds it, apparently for the first time, and stated in full 
generality, in some manuscript notes of Newton’s 
(New.IV.112—114), presumably written not long after the 
appearance of Samuel Fermat’s Diophantus of 1670 (the vol- 
ume which contained Billy’s Inventum Novum), and possibly 
inspired by that publication. 

(IV) The following problem is a mere variant of (I), into 
which it can be transformed by a simple change of variable, 
and it would hardly deserve a separate mention if Fermat 
did not seem to have been rather proud of his way of handling 
it (cf. Fe.1.334, Obs. XLII, and Jnv.Nov.I1.1 = Fe.111.360). 
It consists of the “triple equation” 


Ax + B=u, A'x + B' = v’, A"x + B” = w* 


where B, B’, B” are squares; in the projective space with the 
coordinates (1, u, v, w), it defines a space quartic with eight 
“visible” rational points. If B = 6°, one puts x = 2b¢ + A®’, 
u = b + At, after which one has a “double equation” for 
which the known rational points have to be used according 
to the recipes explained above. 

It will be noticed that in all this there is no number theory; 
everything would be equally valid over any field, in particular 
the field of real numbers. In Fermat’s eyes such work has 
therefore to be classified as “geometry”; we would describe 
it as algebraic geometry, just as the work of Diophantus 
and that of Viéte of which it is an extension; as to this, cf. 
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Chap.I, §X, and Fermat’s comments, quoted there, from 
his challenge of 1657 to the English mathematicians 
(Fe.11.335). In dealing with these questions, Fermat had a 
twofold motivation. 

Firstly, while Diophantus had usually been content with 
one solution for any given problem, he had occasionally 
asked for more; in Dioph.V.8,, for instance, he gave a recipe 
for finding three pythagorean triangles having equal areas, 
and Viete, in his Zetetica (Zet.1V.11 = Op.p.70—71), translated 
it into algebraic terms. If the triangle is taken to be, up to 
a proportionality factor, (2pq, p?—q°, p?+4q°), then, putting 
either x = q/p orx = p/q, and calling A the area, the problem 
amounts to finding a curve |x ~x°] = Ay’ with three rational 
points. Diophantus’s procedure is to find a solution for 
x—- x =x’ — x? ie, x? + xx’ + x’ = 1, thus providing 
two triangles of area A = x — x°, and then observing that 
x + x’ is a solution for A = X* — X; we would interpret 
this by saying that, since (x, 1) and (x’, 1) are on the cubic 
AY? = X — X°, the third intersection of this cubic with Y = 1, 
which is (—x—x’, 1), must also be rational; there is no 
indication, however, that Diophantus, Viéte or even Fermat 
saw in this an application of an underlying general principle. 
What Fermat does, in contrast with Diophantus and Vieéte, 
is to start from just one pythagorean triangle, so as to get 
acurve x — x° = Ay* with one known rational point, then 
to intersect the cubic with the tangent at that point and 
iterate the process as often as desired; this, at least, is de- 
scribed by him as his basic method, but (not surprisingly) 
he has also various ways of reducing the same problem to 
a “double equation” and then applying to it the procedure 
described under (I) (cf. Fe.1.309-311, Obs.XXIII, and 
Inv.Nov.1.38 = Fe.111.348). 

There was another reason why Fermat could not be con- 
tent with the traditional method; it is that this “methodus 
vulgaris” often leads to solutions in negative numbers; in 
the traditional view, these are no solutions at all. Take for 
instance the problem (apparently raised and perhaps solved 
by Diophantus in his lost Porisms; cf. Dioph.V.16) of writing 
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the sum or the difference of two given cubes as the sum or 
the difference of two other cubes; Viéte dealt with it in his 
Zetetica (Zet.1V.18-20 = Op.74-75); Bachet reproduced 
Viéte’s treatment, including even his numerical examples 
(and without any acknowledgment; cf. Chap.I, $X1J), in his 
commentary to Dioph.IV.2. The problem is of course to 
derive a second rational point (x, y) from one known point 
(a, b) ona curve X* + Y* = A, except that Viéte and Bachet 
split it up into various cases according to the signs of a, b, 
xX, y. 

Here Viéte, and Bachet after him, are content with the 
“methodus vulgaris’; in effect, they intersect the curve with 
its tangent at (a, 5); this gives the solution 

a’ + 26° b 2a + b° 


x = aes = ——_—— 
ob’ J Pa 


which, from their point of view, is to be regarded as satis- 
factory only if x, y have the prescribed signs when a, b have 
prescribed signs. For instance, if the problem is to find two 
cubes whose sum is the difference of two given cubes p’, 
q, one would have to take, in our present notation, a = p, 
b = —q,p > q, and require x, y to be positive, so that Viéte’s 
formulas solve the problem if p* > 2q¢° but not otherwise. 
Also, as Fermat points out, they can never solve the problem 
of finding two cubes whose sum is the sum of two given 
cubes. Fermat’s prescription for this batch of problems 
(Fe.I.291—292, Obs.IH; 1.297—299, Obs. VILI-IX) is to iterate 
Viéte’s procedure as many times as may be needed in order 
to obtain a solution as required, and then again as often as 
necessary if further solutions are desired.’ Here, too, he 
gives no formal proof for the fact that this will always be 


° Presumably Fermat did not know that the same idea had already 
occurred to Albert Girard on the occasion of the latter’s French translation 
of Books V and VI of Diophantus; cf. Les Zuvres Mathematiques de Simon 
Stevin . . . reveu, corrigé et augmenté par Albert Girard, Leyde, Elsevier 1634, 
pp. 159-160. 
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successful, but seems content with his experience in such 
matters. 
As to the first discovery of this “method of ascent”, we 


have Fermat’s testimony in his letter of 1644 to Carcavi 
where he refers to the following problems about pythagorean 
triangles (i.e., triples of integers a, b, ¢ such that a® + 6? = 
c*) proposed to him at one time (“autrefois”) by Frenicle (cf. 
Frenicle’s letter of 1641 to Fermat, Fe.11.241) and which 
had formerly eluded both of them; “je ne voyois pas méme 
de voye pour y venir”, he writes to Carcavi (“I saw no way of 
even tackling them”); but now, in 1644, he can solve them 
(Fe.11.265): 

(A) Find a triangle for which a > 6, and (a—6)? — 26? is 
a square; 

(B) Find a triangle such that c and a + b are squares. 

As Frenicle had indicated, they are closely related. To 
see this, put x = a/b in (A) and observe that the problem 
amounts to the “double equation” 


(11) r+l=aw, x-2-l=v 


to be solved in rational numbers subject to the condition 
x > 1. As to (B), call d the g.c.d. of a, b, c, and write (cf. (2), 
Chap.I, §V) 


a b ; C 
(12) a’ == = 2pq, b=sa=p-9qc =5=p +9 


where p is prime to 4, p > q, and p — q is odd. Call 6 the 
g.c.d. of c’ and a’ + 6’; & divides a’ + b’ + c’',i.e., 2p(p+q) 
and 2q(p—q), whose g.c.d. is 2; asc’ is odd, 8 is 1, so that d 
is the g.c.d. ofc anda + 6. Now, ifcanda + bare squares, 
d must be a square, and so are c' and a’ + 6’; moreover, 
if p is prime to q, pf? + g* cannot be a square unless 
p — gis odd. Putting x = —p/q, we find that (B) amounts 
again to the double equation (11), but with the side-condition 
x<-1. 

These problems, and others of the same type, occur re- 
peatedly in Fermat’s correspondence of the years 1643 and 
1644; obviously these are the years of his breakthrough in 
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such matters. To Carcavi, in 1644, he sends his solution 
(1517, 156, 1525) for (A). In 1643 he had sent to Mersenne 
his solution for (B): 


a = 4565486027761, 6b = 1061652293520, 
¢ = 4687298610289 


(Fe.11.261; cf. Fe.I1.259-260,263); this corresponds to the 
values p = 2150905, g = 246792 in (12). Problems (A) and 
(B) are those which he indicates as examples for his iterative 
method in the margin of his Diophantus (Fe.1.336-338, 
Obs. XLIV), describing them as “very difficult questions” 
which could not have been solved otherwise; the details of 
his procedures for solving (B) can be found in the Inventum 
Novum (Inv.Nov.1.22,25,45,111.32 = Fe.11I.339,340-341, 
353-—354,388—389); one of them is to observe that the prob- 
lem is the same as to make c(a+ b) a square (since one may 
assume, as shown above, that a + 6 is prime to c); then, 
putting z = p/q, one has to make (2 + 1)(¢° + 2z— 1) a square, 
which can be done by the repeated application of the last 
one of Fermat’s methods described above under (II) 
(Inv.Nov. 111.32 = Fe.111.388—389). What is more, in the 
same marginal note (Fe.I.337) he makes bold to “assert con- 
fidently” that his solutions for (A) and (B) are the smallest 
possible ones; this was indeed verified by Lagrange in 1777 
(cf. infra, SX VI, and Appendix V). 


§SXVI. 


Experience soon shows that Fermat's iterative method 
leads rapidly to very large integers. An altogether different 
method is needed when one wishes, not to find more and 
more solutions for a given problem, but on the contrary to 
show that there are none, or at least that whatever solutions 
exist can all be derived from a few known ones by some 
regular process. A typical case for such a proof has already 
been described above in §X, where it was shown that the 
area of a pythagorean triangle cannot be a square; this 
amounts to proving that x — x° = 9° has no other rational 
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solution than y = 0,x = Oor +1 (cf. above, §XV). For this, 
as we have seen, a “method of descent” is required, by which 
smaller and smaller solutions can be produced if itis assumed 
that one exists. Another example for this is given by the 
equation x* — Ny* = | (‘“Pell’s equation”) discussed above 
in §XIII; this is of genus 0, not of genus 1, but a method 
of descent was also required for its complete solution. 
For such “negative” questions, Fermat quotes four ex- 
amples in his communication of 1659 to Huygens (Fe.11.433— 
434). Strangely enough, the last of these is no other than 
his erroneous assertion about integers of the form 2°” + 1 
being primes (cf. above, §1V), which, little more than a year 
before, he had been careful to present as a mere conjecture 
(Fe.11.404). Huygens, who had been skeptical about this 
(Huy.11.212 = Fe.1V.122), took note of the discrepancy in 
the margin of the copy he took of that letter (Huy.I1.462); 
one wonders whether ill health had momentarily distracted 
Fermat’s attention at this point. As to the three other ex- 
amples, all they have in common is that they apparently 
depended in part upon Fermat’s theory of the quadratic 
forms X* + AY’ for A = 1, 2,3 and more specifically upon 
the results stated above as lemma 4 of 81X and lemma 5 of 
§XII. We shall first dispose quickly of two of these, which 
had already figured as challenge problems in Fermat’s letter 
of 1657 to Digby (Fe.1I.345) and which seem little more 
than straightforward consequences of those lemmas. In one 
case one has to show that x* + 2 = y* has no solution other 
than (5, 3) in natural integers. In fact, if (x, y) is a solution, 
lemma 5 of §XII (or else the elementary theory of the ring 
Z{V—2], for which cf. Appendix 1) shows that there are 
two integers u, v such that y = uw? + 2v*,x = u® — 6uv’, 
1 = 3u°v — 2v°. The latter formula implies v = +1, 
3u2 = 2 +1, hencev = 1,v = lLy = 3,x = +5. Inthe 
other case one must solve x* + 4 = y’. Here we apply lemma 
4 of §IX. If x is odd, we must have y = u? + v’,x = w - 
3uv", 2 = v? — 3vu?; as v divides 2, it must be +1 or +2; 
if it were +1, x would be even; if v = +2, then we have 
2 = +(8—6u’), hence u? = 1,y = 5,x = 11. On the other 
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hand, if x is even, putx = 2x’, y = 2y’; then x’? + 1 = 2y’°, 
and x’ must be odd. Put z = (x’+ 1)/2, ¢ = (x’—1)/2; this 
gives 


z-t=LyPaH2t?. 
Apply again lemma 4 of §1X; we get 


2=w-3uw", t=uv—- 3, L=z2—t= (u-v\(u?+4uvtv), 


which obviously has no other solution than u = 1, v = 0 
andu = 0,v = —1, givingz = lor0,x’ = +1,x = +2, 
y=? 


Quite different, and to us far more interesting, is the case 
of the equation x° + y* = z° to be solved in integers, or 
equivalently X 3 + Y? = | to be solved in rational numbers, 
this being the first one of the four sent by Fermat to Huygens 
in 1659. It may be regarded as a companion to the one 
discussed above in 8X, both being cases for the “genuine” 
descent as it applies to curves of genus 1; such was also the 
problem stated, along with these two, in Fermat’s com- 
munication of 1640 to Frenicle (Fe.11.195), viz., “find four 
squares in arithmetic progression”. At that early date, Fermat 
had perhaps no more than plausibility arguments for the 
fact that these problems have no solution; but eventually 
he must have obtained a formal proof also for the third 
one, since we are told so by Billy in his Inventum Novum (Inv. 
Nov.I1.11 = Fe.111.365). 

How grateful we should be to the good Jesuit, had he 
shown some curiosity towards such “negative” statements, 
instead of tediously aligning one example after another of 
Fermat’s iterative method, not a few of which must have 
been of his own devising! Some of them do not even make 
good sense. Thus we are left to reconstruct, as best we can, 
some of Fermat’s most interesting proofs by descent, and 
again, as on former occasions, to seek Euler’s and Lagrange’s 
help. In doing so one should not forget that it is in the 
nature of such proofs to admit a number of possible variants, 
one as good as the other; examples for this abound in the 
writings of Euler; an explanation of this phenomenon Is to 
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be found only in the modern arithmetical theory of curves 
of genus 1. Without going into further details, it will be 
enough to mention that such proofs invariably proceed from 
the equations of one curve of genus | to those of another 


(e.g. in §X the curves which, in homogeneous form 
2 4 4 2 


appear there as po(pt+q)(p-q) = m, x - yo = 2, 
ri + 4s = x*, ie, in non-homogeneous form as 
X>-—X=Y*,X*-1=Y°,X'+4 = Y’) and from there 
back to the original equation. The formulas by which one 
goes from one curve to the next define sometimes merely 
a birational correspondence, in which case there is no real 
progress; but mostly they define what used to be known as 
“transformation” in the classical theory of elliptic functions 
(“isogeny” in the modern terminology), and usually one of 
order 2 or 3; in special cases, such a transformation may 
be nothing else than complex multiplication, formulas for 
which are for instance implicit in the proofs of §X and in 
the one to be described below. 

Because of these circumstances, any attempt at recon- 
struction can be no more than a hit or miss proposition; all 
it can do is to show one way in which Fermat could have 
obtained his results. As to the four squares in arithmetic 
progression, the problem was treated by Euler in 1780 in 
a paper, published posthumously (Eu.I-5.56—60 in E758); 
it is somewhat confusedly written, obviously by his assistants, 
at a time when he was totally blind (cf. Chap.III). A more 
direct and more elegantly presented proof can be found 
in J. Itard, Arithmétique et Théorie des Nombres, P.U.F., Paris 
1973, p. 112. Another proof, seeking to exhibit the mech- 
anism of the descent from the point of view of the modern 
theory of curves of genus 1, will be given in Appendix IV. 

Now we turn to the equation x° + y’ = 2°; this defines a 
curve of genus | in the projective plane. Fermat has left 
no indication about his treatment of that equation, except 
to say that it was by descent. It is mentioned by Euler for 
the first time in 1753, when he writes to Goldbach that he 
has found a proof, so different from the known one for 
x‘ + y* = 2? that it leaves him no hope of extending them 
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to a general proof for n-th powers (Corr.I.618). His proof 
for cubes is mentioned by him in 1760 (cf. Chap. III, §XIV) 
and then explained in full detail in the final chapter of his 
Algebra of 1770 (Eu.1-1.486), with the sole difference that 
in 1760 he indicates that it is based on the theory of the 
quadratic form X® + 3Y?, while in 1770 he uses the field 
Q(V—3). As we have repeatedly mentioned (cf. Appendix 
I), those two modes of exposition are essentially equivalent 
and can easily be translated into one another; Fermat would 
have used the former, of course, and we shall do the same 
here; otherwise we follow Euler closely. 

One may assume that x, y, z are mutually prime; then one 
of them must be even, and the other two odd; changing 
their signs if necessary, we may rewrite the equation as 
x + y° = 2°, where z is even and x, y are odd; put x = p + 4, 
y = p — q. We have now 2p(p° + 3q°) = 2°. As p + qis odd, 
p’ + 39° is odd; as z is even, p must then be a multiple of 
4, and g must be odd. Writing our equation as 


Fp + 39°) = (:). 


we shall distinguish two cases, according as z is a multiple 
of 3 or not. If it is not, then the two factors in the above 
equation are mutually prime, so that each of them must be 
a cube: 


p=4r, p+ 3e=s. 
Apply lemma 5 of §XII; this gives 
p = u(ut 3v)(u-3v), q= 3u(u® —v*), 


where u, v must be mutually prime; as g is odd, v must be 
odd and weven. Asp = 4r’, this shows that u/4 is an integer 
and a cube, and that u + 3vandu — 3vare cubes. Putting 
now u = 4e°,u + 3u=f',u- 3u=¢,wegetfPtg= 
(2e)°; this is a solution of the original equation, obviously 
in smaller numbers. 

If z and therefore are multiples of 3, g is not; we can 
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lel) ]-oG). 


Thus p/4 must be a multiple of 9, and the product of p/36 
and q° + 3(p/3)° is a cube; as these two factors are mutually 
prime, they must both be cubes, so that we can apply lemma 
5 of §XII and write 


now write 


pb = 36r’, q = u(u? —9v*), £ = 3v(u?—v’), 


and therefore 
—4y? = v(vtu)(v—u). 


Here again wu, v must be mutually prime, u must be odd, 
and v even. Then 2v, v + u and v — u must be cubes; as 
the first one is the sum of the other two, this is again a 
solution of the original equation. Getting smaller and smaller 
solutions, one comes to a contradiction, just as Fermat said. 

Finally his “confident assertion”, quoted above in §XV, 
about his numerical solutions for problems (A) and (B) de- 
serves some comment. In view of the large numbers involved, 
particularly in the solution for (B), mere trial and error 
could of course not show that these are the smallest solutions. 
Lagrange cleared up the matter in 1777 (Lag.1V.377-398); 
after replacing the problem by the essentially equivalent 
one of solving 2x" — y* = z° in integers (cf. infra, Appendix 
V), he dealt with this by applying to it Fermat’s own method 
of descent; the details are substantially more complicated 
than in the case of Fermat's equation x* — y* = 2’, while 
the principles are the same. The conclusion, however, is 
different; instead of showing that there are no other solutions 
than the trivial ones, one finds now that all solutions can 
be derived from the obvious ones by the processes to be 
described in Appendices III and IV, or (what comes to the 
same, at least in this case) by Fermat’s processes described 
above in §XV. In modern language, the space quartic defined 
by the “double equation” (11) of §XV, and the quartic curve 
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2X* — 1 = Z?, are curves of rank 1, isogenous to one another. 
Taking into account the side-conditions implied by problems 
(A) and (B) as formulated by Frenicle and Fermat, one can 
then verify that Fermat’s solutions are indeed the smallest 
ones (cf. Appendix V). 

Had Fermat obtained such a proof? This is not inherently 
impossible, since Lagrange’s method is no other than his 
own. One rather imagines, however, that he merely allowed 
himself to be guided by the analogy with Pell’s equation, 
where the group of solutions is also, just as here, the product 
of an infinite cyclic group and a group of order 2; in that 
case he might have taken for granted the facts later proved 
by Lagrange. If so, his intuition had not deceived him. 


§SXVII. 


This concludes our analysis of Fermat’s writings “on 
numbers”. Once properly assembled, they present a re- 
markably coherent picture. 

His proofs have almost totally vanished. Writing them 
up, at a time when algebraic notation was still clumsy in the 
extreme, and models were altogether absent, would have 
cost a tremendous effort, and the complete lack of interest 
on the part of his contemporaries must have been depressing. 

In his later years, Fermat must have feared that the fruit 
of his labors would get lost. Here is the melancholy conclusion 
of his final letter to Huygens: 

“Such is in brief the tale of my musings on numbers. I 
have put it down only because I fear that I shall never find 
the leisure to write out and expand properly all these proofs 
and methods. Anyway this will serve as a pointer to men of 
science for finding by themselves what I am not writing 
out, particularly if Monsieur de Carcavi and Frenicle com- 
municate to them a few proofs by descent that I have sent 
them on the subject of some negative propositions. Maybe 
posterity will be grateful to me for having shown that the 
ancients did not know everything, and this account may 
come to be regarded by my successors as the “handing on 
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of the torch”, in the words of the great Chancellor of England 
[Bacon], following whose intention and motto I shall add: 
many will pass away, science will grow” [“Vozla sommairement 
le conte de mes resveries sur le suject des nombres. Je ne l’ay escrit, 
que parce que j apprehende que le loisir d’estendre et de mettre au 
long toutes ces demonstrations et ces methodes me manquera. En 
tout cas cette indication servira aux scavants pour trouver d’eux 
mesmes ce que je n’estens point, principalement si Monsieur de 
Carcavi et Frenicle leur font part de quelques demonstrations par 
la descente que je leur ay envoyees sur le subject de quelques pro- 
positions negatives. Et peut estre la posterité me scaura gré de luy 
avoir fait connoistre que les anciens n’ont pas tout sceu, et cette 
relation pourra passer dans Vesprit de ceux qui viendront apres 
moy pour traditio lampadis ad filtos, comme parle le grand Chancelier 
d’angleterre, suivant le sentiment et la devise du quel j’adjousteray, 
multi pertransibunt et augebitur scientia”: Fe.11.436 = Huy.II. 
461-462 | August 1659]. 

Less resignedly no doubt, but perhaps in a similar vein, 
Galois, on the eve of his death, was writing to his friend 
Chevalier: “Apres cela il y aura, j’espére, des gens qui trouveront 
leur profit & déchiffrer tout ce gachis” (“there will come those, 
I hope, who will find it worth their while to decipher all 
this mess”). 

It does not appear that Huygens reacted in any way to 
Fermat’s letter: “There is no lack of better things for us 
to do”, he had written to Wallis in 1658 (HuyII.211 = 
Fe.IV.121), soon adding, however, a more balanced appraisal 
of Fermat’s and Frenicle’s arithmetical work: “Let them 
give up their problem-setting and follow your example by 
giving us what may bring credit to them and pleasure to 
us. There are indeed admirable properties of numbers whose 
demonstration would be well worth while in my opinion; 
such, surely, is the one in Fermat’s letter at the bottom of 
page 185...” The latter reference is to Fermat’s last letter 
to Digby and his assertions on sums of squares and triangular 
numbers (Fe.11.403—404). 

Nevertheless, if Huygens was undoubtedly more capable 
than most of his contemporaries of appreciating and of 
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criticizing arithmetical work, he was not prepared to take 
up the torch proffered by Fermat. Only once did he come 
at all close to number theory; this occurred in his Descriptio 
Automat: Planetaru (Huy.XX1.587-652), first printed post- 
humously in 1703 but probably composed between 1680 
and 1687 (cf. zbid.p.585). Here, in connection with practical 
problems about automata with dented wheels, one finds a 
thoroughly original and masterly treatment of the best ap- 
proximation of real numbers by fractions, based on the 
continued fraction algorithm (zbid.pp.627—643). The topic, 
which had been already touched upon by Wallis, was even- 
tually taken up by Euler (cf. e.g. Eu.I-14.187-215 = 
E71|1737; Eu.1-15.31-49 = E281|1757; etc.), then by La- 
grange who gave full credit to Huygens, calling it “une des 
principales découvertes de ce grand géométre” (Lag.V11.43—44 
= Fu.I-1.536); it is noteworthy, however, that Huygens 
himself never bothered to get it published. 

Fermat’s torch had indeed been long extinguished when 
Euler, in 1730, picked it up, kindled it anew, and kept it 
burning brightly for the next half-century. That story will 
be the subject of the next chapter. Here we merely note 
that in 1742, after seeking for more than ten years with 
little success to clear up the questions pertaining to sums 
of two squares, Euler, at the height of his fame, wrote from 
Berlin to Clairaut in Paris, asking him to institute a search 
for Fermat’s manuscripts (Eu.I[V A-5.124; cf. Chap.UL, 81V); 
according to him, “many mysteries might still lie hidden 
there”. The answer was discouraging (ibid., p. 129); as he 
reports to Goldbach (Corr.I.168) the search was hopeless, 
since there was no interest any more for such matters (“da 
der gotit fiir dergleichen Sachen ber den Meisten erloschen 1st, so 
ist auch die Hoffnung verschwunden”). There was no remedy 
but for Euler to do it by himself. And so he did, with La- 
grange’s valuable cooperation from 1768 onwards. Indeed, 
by the time of Euler’s death, not only had all the territory 
explored by Fermat been regained, but further ground had 
been conquered. If some mysteries remained, they concerned 
questions which had lain beyond Fermat’s horizon. 
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In one direction Fermat has cast his shadow well into the 
present century and perhaps the next one. Already Leibniz 
had announced it; here is what he wrote with prophetic 
insight in 1702: 

“|. . my hope is that those will come who will spread the 
seeds of the new science [1.e., Leibnizian analysis} further 
and reap a richer harvest, especially if they attend more 
diligently than heretofore to the advancement of Diophan- 
tine algebra, which Descartes’ disciples have neglected be- 
cause they failed to perceive any use for it in Geometry. I, 
on the other hand, remember having repeatedly suggested 
(what could seem strange to some) that the progress of our 
integral calculus depended in good part upon the de- 
velopment of that type of Arithmetic which, so far as we 
know, Diophantus has been the first one to treat system- 
atically .. .” (Acta Eruditorum 1702, p. 219 = Math.Schr.ed. 
C.I.Gerhardt, (II) Bd. I, Halle 1858, p. 360). 

In modern terms, Leibniz was trying to say that the study 
and classification of algebraic differentials and their integrals 
depended upon the methods of algebraic geometry for 
which, in his days, Diophantus, Viéte and Fermat offered 
the only existing models. He mentions neither Viéte nor 
Fermat; perhaps he had never studied them; but his intuition 
was not leading him astray. 

When the same threads are later picked up, first by Euler 
and then by Lagrange, with brilliant success, the close con- 
nection between “Diophantine algebra” (as Leibniz called 
it) and elliptic integrals begins to appear below the surface. 
It may well be implicit in Euler’s oft-repeated admonition 
to his fellow-analysts that number theory is, even from their 
point of view, no waste of time. Sometimes one can detect 
in his words an echo of Leibniz’s far-sighted remark quoted 
above: 

“,.. the author has often mentioned that the Diophantine 
method, if further developed, will redound to the benefit 
of the whole of Analysis, so that he is far from feeling sorry 
for the prolonged efforts he has devoted to that branch of 
mathematics ...” (Eu.I-2.428 in E255), and further on in 
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the same paper: “this shows how much more there is left 
to do in Diophantine Analysis, which beyond any doubt 
could contribute greatly to the whole of Analysis, finite and 
infinitesimal. In fact, one main tool of integral calculus con- 
sists in transforming irrational differential expressions into 
rational ones, and this comes directly from Diophantine 
Analysis, so that one has every right to expect further prog- 
ress from that same study . . .” (zbzd.p.454). 

In that paper, written in 1754, Euler proves by the 
“Diophantine method” the rationality of the cubic surface 
x+y + 2 = #, after referring to Euclid’s proof for the 
corresponding result for the equation x? + y? = 27. As to 
curves of genus 1, Euler became interested in them quite 
early in his career (cf. Corr.1.31|1730). He deals extensively 
with them in a paper of 1738 where he rewrites in his own 
style Fermat’s and Frenicle’s treatment of the equations 
xi + y* = 2 (Eu.I-2.38-58 = E98; cf. above, §X) and where 
he deals, also by descent, with a few equations of the same 
kind, all belonging to what we would now call the “lemnis- 
catic” case. Did he immediately recognize a problem of the 
same nature when he raised, as early as 1730, the question 
of the rationality of the integral fdx/V1—x* (Corr.1.47,51), 
or at the end of 1751 when he received Fagnano’s Produziont 
Matematiche containing a number of remarkable theorems 
about the rectification of the lemniscate and about “lem- 
niscatic” integrals (cf. Chap.I, $1)? We have no way of telling; 
but Jacobi, upon getting from the St. Petersburg Academy, 
in 1834, the latest volume of Euler’s posthumous papers, 
including several on Diophantine equations of the form 


y? = A + Bx + Cx? + Dx? + Ex’, 


did not fail to observe (/ac.11.53) that Euler’s procedure in 
such problems amounted to writing the multiplication for- 
mula for the elliptic integral 
dx 
VA+ Bx+ Cx? + Dx? + Ex! 
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and added that this “memorable agreement” (“consensus tle 
memorabilis”) between both theories could hardly have es- 
caped the notice of “their common initiator”. In this, as we 
know, he was not giving due credit to Fermat, whose work 
was far less familiar to him than Euler’s; but, in matters 
pertaining solely to Euler, he was surely a good judge. He 
might have said the same about Lagrange, had a similar 
occasion arisen. Lagrange had taken up elliptic integrals in 
1766 (Lag.I1.5-33). In 1777 he wrote an important paper 
on the diophantine equations 2x* — y* = +2? and related 
problems (Lag.1V.377-398; cf. above, §XVI). He came back 
to elliptic integrals in 1784 (Lag.11.253-312) with a paper 
where he is sometimes credited with having “rediscovered 
independently” the so-called Landen transformation, Le., 
the transformation (or “isogeny”) of order 2 for elliptic 
integrals.'° Could he have been unaware that this same 
transformation had been a main tool in the infinite descent 
as practised by Fermat, by Euler, and again by himself in 
his own paper of 1777 (cf. Appendix IV)? Indeed Leibniz’s 
prophecy was coming true. 

In the nineteenth century, Fermat’s legacy was once more 
threatened with extinction. Reduced to essentials, it consisted 
of three correlated principles. One was that algebraic ge- 
ometry, and, to begin with, algebraic curves deserved to be 
studied, not just over the real or eventually the complex 
number-field, but over other fields as well, and signally over 
the field of rational numbers. Next came the principle that 
rational transformations were the proper tool in this study. 
Thirdly there was the infinite descent as it applies to dio- 
phantine problems. In the nineteenth century, however, 
the conspicuous success of the function-theoretic method 
in algebraic geometry led for a while to almost total victory 
for the complex numbers over their rivals, beginning with 


'° Cf. e.g. A. Enneper, Elliptische Functionen (Halle 1876), p. 307. Actually 
Lagrange knew Landen’s paper of 1775 on this subject; he quotes it and 
comments briefly upon it in a letter of 1777 to Condorcet (Lag. X1V.41). 
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the theory of elliptic functions which swallowed up the study 
of curves of genus | (to the point of giving them the name 
by which they are usually known at present), and continuing 
with Riemann’s theory of abelian functions. The develop- 
ment of so-called “pure” geometry, and particularly of the 
theory of projective spaces, initiated by Poncelet and de- 
veloped by Chasles, Mobius and many others, could have 
restored the balance, had it not, from the beginning, operated 
almost invariably over the field of complex numbers; this 
tradition was continued in Italian geometry and beyond it. 
During the whole period, Fermat’s name came to be asso- 
ciated almost exclusively with his “last theorem”, i.e., with 
the equation x” + y" = z”, which, however fruitful it turned 
out to be in the hands of Kummer, does not belong to our 
present subject. 

A first faint reaction set in with the paper of 1890 of 
Hilbert and Hurwitz (Acta Math.14, pp. 217-224), where 
for the first time it was pointed out explicitly that the bi- 
rational theory of algebraic curves provided the proper 
framework for the study of diophantine equations. Poincaré, 
in 1901, made a broader survey of the same ground, and, 
unwittingly perhaps, sought to make use of the trisection 
of elliptic functions as a tool for infinite descent. The true 
breakthrough came in 1922 with Mordell’s celebrated paper 
(Proc.Cambridge Phil.Soc.21, pp. 179-192); here, if Fermat’s 
name does not occur, the use of the words “infinite descent” 
shows that Mordell was well aware of his indebtedness to 
his remote predecessor. Since then the theory of elliptic 
curves, and its generalization to curves of higher genus and 
to abelian varieties, has been one of the main topics in modern 
number theory. Fermat’s name, and his method of infinite 
descent, are indissolubly bound with it; they promise to 
remain so in the future. 


Appendix I 
Euclidean Quadratic Fields 


As pointed out in §§VIII and XII, Fermat’s results on 
the quadratic forms X* + AY? (where A is 1, +2 or 3) can 
best be understood by considering the rings Z[6@] where 6 
isi = V—-1, V+2, orj = (~1+V—3)/2. This will be ex- 
plained now. 

In each one of these cases we put R = Z[@]; R is the ring 
of integers of the field K = Q(V —A). We write € > €’ for 
the automorphism of K which changes V —A into —V—A, 
and we put N(é) = &&’; N() is called the norm of €. We 
have 


N(xtyV~—A) = x? + Ay’, N(x+yj) = x — xy + 9°. 


In all cases, we have N(én) = N(€)N(y); as pointed out in 
Chap. I, §VIII, this is equivalent to the basic identity for 
the form X? + AY’, ie., to (5) and (6) of Chap. I, §VIII, 
and to (5) of Chap. II, §XII. 

The units of the ring R are the integers n of K which 
satisfy N(n) = +1; these are +1, +2 for A = 1; +1 for 
A = 2; +1, +j, +7? for A = 3. For A = —2, there are 
infinitely many units, viz., all the integers of the form +” 
with e = 1 + V2,n € Z (cf. §XII). Two non-zero integers 
a, B are called associates if each is a multiple of the other in 
the ring R; this is so if and only if a~'@ is a unit. A non- 
zero integer a = a + b@ will be called normalized, in the case 
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A= 1,ifa>0,b=0; inthe case A = 2,ifa > O0ora = 0, 
b > 0; in the case A = 3,if a > b = 0; inthe caseA = —2, 
if —-b <a <b. Among the associates of a given integer, one 
and only one is normalized; this is easily seen for A = I, 2, 3; 
for A = —2, the proof is essentially the one given by Fermat, 
and reproduced above in §XII, for the existence of a “min- 
imal” solution (x, y) for an equation n = 2y” — x°. In general, 
the product of two normalized integers is not normalized; 
this may be regarded as the source of Fermat’s error discussed 
in §XII. 

Let ¢ = x + y@ be any element of the field K; then there 
is at least one integer a = a + b6 in K, such that |N(f—a)| < 1; 
indeed this will be so if we take for a, 6 the closest rational 
integers to x and to y, respectively. Consequently, if a, B 
are two integers, and B # 0, there are integers y, p such 
that a = By + pand |N(p)| < |N(B)|; to obtain this, one has 
only to take y so that |N((@/B) — y)| < 1. Then the “euclidean 
algorithm” shows, just as in Book VII of Euclid, the existence 
of a g.c.d. 6 for any two integers a, 8 in the ring R; as in 
Euclid, this is defined as a common divisor of a and 6 such 
that every common divisor of a and B divides 6; clearly any 
two such integers must be associates, and there is one and 
only one normalized g.c.d. to a, 8. Instead of the “euclidean 
algorithm”, one may also take for 6 anyone of the numbers 

= af + By for which |N(y)| has the smallest value when 
one takes for € and 7 all the elements of R; the process of 
“euclidean division” shows then that any such integer is a 
g.c.d. for aw and B. 

In the ring R, primes are defined as those integers, other 
than 0 and the units, which have no other divisors than 
their associates and the units. From the existence of the 
g.c.d. it follows that a prime that divides a product must 
divide one of its factors. If a is an integer, and |N(a)| > 1, 
let 7 be one of its divisors for which |N(z)| has its smallest 
value > 1; this must be a prime. From this it follows at once 
that every integer in R is a product of primes and a unit, 
and that its representation as such a product is “essentially” 
unique (i.e., unique up to unit factors); it is unique, up to 
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the order of the factors, if one stipulates that its prime 
factors should be normalized. 

To find the primes in the ring R, one needs to know the 
quadratic residue character of —A modulo the rational 
primes; this has been determined above in §VII (for A = — 1) 
and in §XII (for A = +2,A = 3). Let p bea rational prime; 
if it is not a prime in R, it has in R a prime divisor 77, which 
we may assume to be normalized, and N(z) is then a divisor 
of N(p) = p*, other than | and p’ and therefore equal to 
+p. Now consider separately the four cases: 


(a) ifA = 1, putw =x + yi; thenp = x° + y*, andp=1 
(mod 4) unless p = 2; in the latter case, 7 = 1 + 2. 

(b) if A = 2, put 7 = x + yV—2; then p = x* + 2y, and 
p = 1 or 3 (mod 8) unless p = 2; in the latter case 
n= V-2. 

(c) ifA = —2, putw =x + yV2, where —y <x < y; then 
p = 2y? — x* and p = 1 or 7 (mod 8) unless p = 2; in 
the latter case 7 = V2. 

(d) if A = 3, puta = x + y; then p = x° — xy + »*. Here 
x, y cannot both be even, so that, among the associates 
of 7, one (viz. 7 itself if y is even, 77” if x is even, and 
aj if x and y are odd) must be of the form z + t V-3 
with integers z, ¢, so that p = z° + 3°; we have pHl 
(mod 3) unless p = 3. In the latter case, p has the prime 
divisor V — 3 (or in normalized form 2 + 7 = —jV—8). 


If, as above, p is not a prime, and 7 is a prime divisor of 
p, so is 7’; it is easily verified that 7’ cannot be an associate 
of 7 unless p = 2 (in cases (a), (b), (c)) or p = 3 (in case 
(d)). Leaving the latter cases aside, p is in each case a “divisor 
of the form X* + AY?” (as defined above in §XII) or, what 
amounts to the same, — A is a quadratic residue modulo p. 
As to the converse, assume that p (other than 2 if A = | or 
+2, and other than 3 if A = 3) is a divisor of the form 
X? + AY*; this means that p divides some integer a® + Ab’, 
with a and b prime to p; as neither one of the integers 
a + bV—Aisa multiple of p in R, this implies that p cannot 
be a prime in R. 
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The problem of the number of representations of an 
integer N by one of the forms X? + AY? can now be better 
formulated as the problem of the number of elements of 
R with the norm N when two associates are not counted as 
distinct; in modern terms, this amounts to determining the 
zeta-function of the field K = Q(V—A). As the various 
cases under consideration are quite similar, it will be enough 
to examine the case A = 1, R = Z[t]. We have to solve 
N = aa’. If N has a prime divisor p of the form 4n — I, 
then, since p remains a prime in R, it must divide @ or a’ 
and must therefore divide both, so that N is a multiple of 
p’; this gives p-*N = (a/p)(a'/p). This reduces the problem 
to the case when N has no such prime divisor. Then we can 
write 


N = 2°ph... pe 


where each p; is of the form 4n + | and can therefore be 
written as 7,7; , where 77; is a prime in R. For any 2, let A,, 
pt; be the exponents of 7;, 7; n a; they are also the exponents 
of 77, 7;in a’, and we have ); + yw; = 7;. Up toa unit factor 
2”, a must then be given by 
= (14+ at ay. alle. 

Clearly the number of possible choices for @ is II(7;+ 1). 
This must be halved for the number of representations of 
N by X* + Y®, since a and a’ give the same representation. 
The representation is “proper” in the sense of §1X if and 
only if A; = 0 or »; = 0 for all 2, since otherwise a@ would 
be a multiple of p;; thus the number of proper representations 
is 2"~'. All this agrees of course with Fermat’s results de- 
scribed in §IX. 

We can now make use of the ring Z[V2] in order to re- 
interpret and clarify Fermat’s communication of 1657 to 
Frenicle (JEH.41—44; cf. above, §XII). Let 7 = a + bV2 
be a normalized integer, with 0 < a < 6; put p = ew” = 
A + Bv2; this gives 


A =a’ + 20° — 4ab, B =a’ + 2b’ — 2ab, 
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and it is easily seen that also p is normalized. Now assume 
that p is a rational prime of the form p = 2x” — 1, and that 
p = 2y? — 1. Puta = 1,b = x,so that p = —77'; then z, 
a’ are primes in R. Putw = 1 + yV2,so thatww’ = —p? = 
~—a'n'®; as w, 7’ are primes, and as @ is not a multiple of 
p, this implies that @ is an associate of 7° or of 7”, or, what 
amounts to the same, of p or of —p’. But @, p, —p’ are 
normalized; therefore » must be either p or —p’, and we 
get 1 = +(14+ 2x?— 4x), just as in §XII, and x = 1 or 2. If 
p were not a prime, this proof could clearly not be carried 
out, despite Fermat’s assertion to the contrary (cf. §XII). 


Appendix IT 


Curves of Genus | in Projective Spaces 


As we have seen, most of the diophantine problems studied 
by Fermat concern equations or sets of equations which, as 
we would say now, define elliptic curves, i.e., algebraic curves 
of genus 1; the problem in each case is to find rational (or 
sometimes integral) points on the curve. For a proper un- 
derstanding of his methods it is convenient to translate them 
into the language now usual in algebraic geometry. 

Much of what we have to say here, and indeed much of 
Fermat’s work on this topic (cf. above, §XV) retains its validity 
over arbitrary ground-fields, including even those of char- 
acteristic p > 1; in order to simplify the language, we shall 
disregard the latter. So far as Fermat is concerned, only 
the field R of real numbers (the field of “geometry” as 
understood in his time), the field Q of rational numbers, 
and implicitly the field of all algebraic numbers, have any 
relevance. 

Let I’ be an algebraic curve; we assume that it is irreducible 
and defined by equations with coefficients in a subfield K 
of the field C of complex numbers. By a point of T’ we will 
understand one with coordinates in C; it will be called rational 
if its coordinates are in K, algebraic if they are so over K. 
A divisor means a formal sum a = *%a;A,, where the A; are 
points of I and the a; are integers; 2a; is the degree of a. 
If all the a; are = 0, a is said to be positive, and we write 
a>. If P; is an algebraic point of T, and P;, Po, ..., P,, are 
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all its distinct conjugates over K, the divisor =P; is said to 
be prime rational (over K); a divisor is called rational if it 
is a linear combination, with integral coefficients, of prime 
rational divisors. By a rational function f(M) on I, one un- 
derstands a rational function of the coordinates of the point 
M on T;; it is called rational over K if its coefficients lie in 
K. One defines poles and zeros of functions on I, with their 
multiplicities, in the usual manner; every function has as 
many zeros as it has poles, each being counted as many 
times as its multiplicity indicates. If the zeros of f are A;, 
Ag, ..., An, and its poles are By, By, ..., B,, then ZA; — XB; 
is called the divisor of f and is written div(/); it is of degree 
0; if fis rational over K, so is div(f). If a divisor a is the 
divisor of some function f, it is said to be equivalent to 0, 
and one writes a ~ 0. Equivalences between divisors can 
then be written in the usual manner. 

The curve I is said to be of genus | (or “elliptic”) if, to 
every divisor a of degree 1 on I’, there is one and only one 
point A ~ a; we shall henceforth assume that this is so. 
Then, if m is a rational divisor of degree m > 0 on I, the 
functions f on I’, rational over K, such that div(f) > —m 
(and in particular the functions with the poles M,, ... M,, if 
mt = M,) make up a vector-space of dimension m over K. 
In particular, if m = 1, the function f is unique up to a 
constant factor, and its divisor is div(f) = N — m, where 
N is the point such that N ~ m; this point N, which is thus 
obtained as a zero of f, must then be rational. 

Consequently, if there is on Ia rational divisor of degree 
1, there must also be on I at least one rational point A. In 
that case, the functions f on I’, rational over K, such that 
div(f) > —2A (resp. > —3A) make up a vector-space V 
(resp. W) of dimension 2 (resp. 3) over K. Take any non- 
constant x © V; then V consists of the functions ax + B on 
I’, with a, B in K; similarly, if we take y in W — V, W will 
consist of the functions Ay + wx + v with dA, w, vin K. By 
suitable choices of x in V and of y in W, one may assume 
that they satisfy an equation of the form 


y =x — ax —b 
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with a and 6 in K; here the right-hand side has no multiple 
root, since otherwise this would not define an elliptic curve. 
Then one can identify [ with the plane cubic defined by 
this equation, or rather with the cubic defined in the pro- 
jective plane P? by the homogeneous equation 
¥°Z = X° — aXZ? — 62°, 

A being then the point at infinity (0, 1, 0) on the cubic. If 
M,N, P are three points on the cubic, they lie on a straight 
line if and only if M + N + P ~ 3A; in particular, if, for 
every P = (x, y) on the curve, we write P’ = (x, —y), we 
have P + P' ~ 2A. If, for every pair of points (M, N) on 
the curve, we define a point M +N by the relation 
M+N~M+N — A, the law (M,N) > M + N defines 
the set of points on I as a commutative group of which the 
rational points make up a subgroup, A being the neutral 
element; when dealing with a given cubic, one may write 
M + N instead of M + N and 0 instead of A. We have 
P=M +Nifand only if M, N, P’ are on a straight line, 
i.e., if P’ is the third point of intersection with the cubic of 
the straight line through M and N. Similarly, M + M, for 
which one may write simply 2M, is the point P such that P’ 
is the intersection of the cubic with the tangent to the cubic 
at M. 

Assume now merely that [has a rational divisor a = 
A + A’ of degree 2; we can again take a non-constant x on 
I, rational over K, such that a + div(x) > 0. The space W 
of functions f, rational over K, such that 2a + div(f) > 0 is 
of dimension 4 and contains a subspace W’ generated by 
1, x, x°; take y in W and not in W’. After replacing y by 
y + Ax® + wx + pv with suitable values of A, wu, v, we have 
an equation 


y = Apx* + a,x® + Qyx* + A3x + a4 


where the right-hand side has no multiple roots; this may 
be regarded as defining the curve I. In an obvious sense, 
the two points A, A’ which make up the divisor a are the 
points at infinity of the curve; they are rational if and only 
if ao is a square in K. 
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Similarly, if P has a rational divisor a of degree m 2 3, 
take a basis (x, ..., X») of the space of functions f, rational 
over K, such that a + div(f) > 0; map I into the projective 
space P”~' of dimension m — | by taking, as the image of 
each point M of F, the point of P”~! with the homogeneous 
coordinates 


(x1(M), «2.5 Xm(M)). 


The image of [ in P”~' is then a curve 2 of degree m in 
p”~*. Its intersection with the hyperplane 


a,X1 tose + OinXm = 0 


consists of the points P, ..., P,, given by 
ain(S aa) = Ss P, —- a 
1 1 


and m points P,, ..., P,, of T lie on a hyperplane if and only 
if 27 P; ~ a. If m = 3, we thus get for F a model which is 
a plane cubic; the model Q is a space quartic if m = 4. In 
the latter case, consider the 10 functions x;x;, for 
1 <1 <j < 4; they are in the space of functions f satisfying 
2a + div(f) > 0; as this space is of dimension 8, we have 
two linear relations between the x,x;; this amounts to saying 
that © is the intersection of two quadrics P(X) = 0, 
V(X) = 0, where ®, V are two quadratic forms in the X;, X;. 

Finally, assume that we have on I’ two non-equivalent 
rational divisors a = A + A’,b = B + B’ of degree 2; take 
non-constant functions x, y, rational over K, such that 
a + div(x) > 0, b + div(y) > 0. Consider the 9 functions 
x'y with i, 7 < 2; they are in the space of functions f such 
that 2a + 26 + div(f) > 0; as this is of dimension 8, there 
must be a linear relation between the x’'y’, i.e., a relation 
F(x, y) = 0, where F is a polynomial of degree <2 in x and 
<2 in y. Here y must occur in F, since otherwise x would be 
a constant; similarly x must occur in it. If F were of degree 
1 in y, y would be of the form R(x), R being a rational function. 
But it is easily seen that the divisor of poles of y would then 
be linearly equivalent to a multiple ma of a. If m > 1, this 
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would contradict the assumption 6 + div(y) > 0; it would 
also contradict that assumption if m = I, since 5 is not 
equivalent to a. Consequently we have a relation 


F(x, y) = 0 


with F of degree 2 in x and of degree 2 in y. Diophantine 
equations of that type were first considered by Euler (cf. 
infra, Appendix V, and Chap.III, §XVI). 


Appendix III 


Fermat’s “Double Equations” as Space Quartics 


In our discussion of the “double equations” studied by 
Fermat and Euler and of the infinite descent as practised 
by them, we shall need some elementary facts about space 
quartics of genus 1. Let 0 be such a curve; as we have seen 
in Appendix II, it can be defined by a pair of equations 
® = W = 0, where ® = 0 and V = 0 are the equations of 
two quadrics in P*. For any é, call Q, the quadric given by 
® — EW = 0; these quadrics make up the pencil of quadrics 
through Q (including of course the quadric V = 0, which 
may be written Q..). 

For any quadratic form 

4 
b= Dd) ajXiX; 
ij=l 
we shall write det(®) = det(a;). A change of homogeneous 
coordinates with the determinant 6 multiplies det(®) with 
5°; if ® is multiplied with a constant A, det(®) is multiplied 
with \*; therefore det(®) is well defined up to a square 
factor when the quadric ® = 0 is given. For ®, V as above 
we shall write F(€) = det(®—- EW). For F(é) = 0, Q; isa 
cone; if Q is a space quartic this cone cannot degenerate 
into a pair of planes. Clearly the “pencil” {Q,} must contain 
at least one cone; one may assume this to be Qy; then 0 is 
a root of F; by taking coordinates so that ® = X} — 2X\Xo, 
it is easily seen that 0 cannot be a double root of F unless 
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the quadric VY = 0 goes through the vertex (0, 0, 0, 1) of 
the cone ® = 0, in which case 2) is of genus 0 and not 1. 
Thus F is of degree 4 or 3 and has four distinct zeros on 
the projective straight line; in other words, the pencil of 
quadrics through 2 contains exactly four cones. 

Now we will sketch a proof for the following result: 


Proposition. There is a rational mapping w from Q X Q.to the 
curve F(é) = 1° such that: (1) if M, M', N, N' are four points 
on Q, then w(M, M’') = oN, N’) if and only if 
M+M'~N +N’; (u) of Ais a rational point on Q, then the 
mapping M — (A, M) is an isomorphism of © onto the curve 
F(é) = 9°. 

For convenience we begin by recalling some well-known 
facts: 


Lemma. A quadric ® = 0 (not a pair of planes) carries one 
family of straight lines or two according as det(®) 1s 0 or not; if 
it contains a rational straight line, then det(®) 1s a square. 


Ifdet(@) = 0, the quadric is a cone and the above assertions 
are obvious. If det(®) # 0, then, after suitably extending 
the groundfield, one can assume that the quadric contains 
a rational point A; take coordinates so that A = (0, 0, 0, 1) 
and that the tangent plane at A is X; = 0; after possibly 
another rational change of coordinates, one can write, up 
to a constant factor: 


® = X? — dX} - 2X3X,; 


here det(®) = d # 0. Then A lies on the two straight lines 
X, = +X.Vd, X, = 0.In particular, if we had assumed that 
the quadric contains a rational straight line, we could have 
chosen for A a rational point on it and proceeded as above; 
then Vd must be rational, and det(®) must be a square. In 
any case, the two straight lines through A belong respectively 
to the two families 


(X,+Xe\/d) = uX3, — u(X) FXo\/d) = 2tXy. 


Clearly each point of the quadric lies on one and only one 
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line in each one of these families; this implies that all straight 
lines lying on the quadric belong to one or the other of 
these families. 

Now, going back to the quartic 0 defined by ® = V = 0, 
take two points M, M’ on Q; call Ava the straight line through 
M and M' (or the tangent to 0 at M, if M = M’). If the 
quadric ® — €V = 0 contains one point of Ajj beside M 
and M’, it must contain Ay, ; thus ®/V has a constant value 
on Ava, for which we write &(M, M’); clearly this is a rational 
function of the coordinates of M and M’, with coefficients 
in the groundfield K. In view of the lemma, F[&(M, M’')] 
must be a square after the adjunction of the coordinates of 
M and M’ to the groundfield; thus there is a rational function 
74(M, M’) of those coordinates, such that 


F{é(M, M’)] = 9(M, M'y* 
Write ' for the curve F(é) = 7° and » for the mapping 
(M, M') > (€(M, M’), n(M, M')) 


of O x OQ into T; we will show that this has the properties 
described above. We begin with (ii). 

As in Appendix II, take a groundfield K such that ® and 
W have their coefficients in it; assume that there is on 0 a 
point A with coordinates in K. For any &, take 7 such that 
7° = Fé). Let A be one of the straight lines through A on 
Q,; as shown in the proof of the lemma, it is rational over 
the field K(&, n). So is therefore the point P, other than A, 
where A meets some quadric other than Q,, say the quadric 
® = 0, in the pencil {Q,}. As P is then on 2, we have A = 
Aap, &(A, P) = &, and therefore n(A, P) = +7; replacing 
1 by —7 if necessary, we may assume that 7(A, P) = 7, so 
that w(A, P) = (€, n). Since at the same time P is rational 
over K(€, n), this proves that P > w(A, P) is indeed a birational 
mapping, and therefore an isomorphism, between Q and 
I. In particular, this shows that w(A, P) = w(A, P’) implies 
P = P’,a conclusion which remains valid even if A is not 
rational, since we can always enlarge K so as to make it such. 

Now take M, M’, P on ©; put w(M, M’) = (&, n). Call P’ 
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the fourth point of intersection of Q with the plane I through 
M,M',P; puta = M+ M' + P + P’. As Aya and App: lie 
in IT, they intersect at a point R. The quadric Q, contains 
Aum’, hence R, hence App; therefore we have &(P, P’) = €. 

Let N, N’ be two more points on 0; we have N + N' ~ 
M + M' if and only if N + N’ + P + P' ~ a; as has been 
shown in Appendix II, this is so if and only if N, N’, P, P’ 
he in a plane; if so, the above proof shows that we have 
E(N, N') = &(P, P’) = &, and therefore n(N, N’) = +7. Now, 
M and M' being kept fixed, take N variable; as n(N, N’) 
is given rationally in terms of N, N’, and as 7 is constant, 
7(N, N’) must be independent of N. Taking N = M, we 
get N’ = M’. Therefore 7(N, N') = n, and a(N, N’) = 
w(M, M’), as asserted. 

Conversely, assume that w(N, N’) = w(M, M’), and take 
N" such that N + N” ~ M + M’'. Then we have 


w(N, N") = w(M, M’) = oN, N’). 


As proved above, this implies N” = N’, completing the proof 
of our proposition. Our result can be interpreted by saying 
that the equivalence classes of divisors of degree 2 on 0 are 
parametrized by L. 

Our proposition can be applied for instance to the curves 
defined, in the language of Diophantus and of Fermat, by 
a “double equation” 


ax? + Qbxy + cy = 27, a’x®? + QW'xy + cy? = #7. 
Since they consider only curves of that form which have in 
projective space at least one “visible” rational point, we may, 


for our present purposes, assume that this is so. Then we 
see that the curve is isomorphic to the one given by 


Y? = X[(b'X—b)” — (a’X—a\(c'X—0)]. 
Take for instance the problem 
x? + 4y? = 27, vt+y=t 


for which Euler proved (cf. above, §X VI) that there are no 
solutions other than the obvious ones. We see that these 
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equations define a curve isomorphic to 
y* = —X(X—1)(X~4). 


On the other hand, take the problem, considered by Fermat 
(cf. §XVI) of finding four squares in arithmetic progression. 
This amounts to 


r= e tr wa yPt 
By the same rule as before, this curve is isomorphic to 
Y? = X(X+2)(2X+1) 


which is changed into the previous equation by substituting 
— 4X, aY for X, Y. Thus, as Euler found out, the two problems 
are equivalent. 


Appendix IV 


The Descent and Mordell’s Theorem 


Our purpose is now to provide some background for 
Fermat’s “method of descent” in his treatment of elliptic 
curves (cf. above, §XV1I) in the light of modern work on 
the subject’. 

As has been shown in Appendix II, if an elliptic curve 
has at least one rational point, it can be exhibited as a cubic 
y® = f(x), where fis a polynomial of degree 3. The rational 
points on the cubic make up a group, and Fermat’s method 
of descent has been refined by Mordell in 1922 so as to 
show that this group is “finitely generated”; this means that 
there are on the cubic finitely many rational points such 
that all others can be derived from these by the group op- 
eration defined in Appendix Il. The same is true if the 
groundfield, instead of being the field Q of rational numbers, 
is any algebraic number-field of finite degree over Q. Let 
r be the largest integer such that there are on the cubic r 
rational points which do not satisfy any linear relation with 
integral coefficients not all zero; then r is called the rank 
of the curve. Even now no systematic method is known for 
determining the rank of a given cubic, let alone a set of 
generators for its group of rational points. 


"Cf. e.g. J. W. S. Cassels, J. London Math. Soc. 41 (1966), pp. 193— 
291, and A. Weil, Coll. Papers 1.47-57. 
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For simplicity we consider the case when the roots of f 
are rational integers a, 8, y, the cubic I being given by 


(1) y? = fix) = a(x Bx). 


Put x = X/Z, where Z is a natural integer and X an integer 
prime to Z. This gives 


(Z*y)? = Z(X—aZ)(X — BZ)(X — yZ). 


As Z is prime to the other factors in the right-hand side it 
must be a square, Z = T*. Traditionally one observes that 
the g.c.d. of any two of those factors divides a fixed integer, 
so that one may write X — aZ = AU®, where A is susceptible 
of only a finite number of values, therefore x — a@ = Aw 
with u = U/T, and similarly x — B = Bu’,x — y = Cu’, 
where u, v, w are rational and B, C can take only finitely 
many values. If one uses only the first one of these relations, 
one gets a rational point (u, z=y/u) of the quartic 


2 = A(Aw+a-B)(Auv?+a-y), 


which is one of finitely many thus attached to I’. On the 
other hand, after properly choosing the signs of u, v, w, 
one can write 


(2) Aw+a=Bv+B=Cw+y 
(3) x = Au? + a, y = VABC - uvw. 


Here ABC has to be a square; (2) defines a space quartic 1 
in the (u, v, w)-space, and (3) defines a mapping of Q into 
I’. The original problem is thus reduced to finding rational 
points on finitely many such quartics. Either one of these 
methods is typical of the traditional descent. 

An alternative procedure, equivalent to the one described 
above but more consonant with the modern theory would 
be as follows. Let (x, y) be a solution of (1); we can write, 
in one and only one way, x ~ a@ = Au*®,x — B = Bv’, 
x — y = Cw”, with rational u, v, w and squarefree inte- 
gers A, B, C (positive or negative); equations (2), (3) are satis- 
fied, and ABC has to be a square. 
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For any squarefree integers A, B, C (positive or negative) 
such that ABC is a square, call 0(A, B, C) the space quartic 
defined by (2). The first step in the descent thus consists in 
reducing (1) to the finding of rational points on infinitely 
many quartics Q(A, B, C). The next step will be to reject all 
but finitely many of these because they can have no rational 
points. 

In homogeneous coordinates, Q(A, B, C) may be regarded 
as defined by equations 


(4) AU? + aT? = BV? + BT? = CW? $+ yT?: 


Q(A, B, C) has a rational point if and only if (4) has a solution 
in integers U, V, W, T without a common divisor; a necessary 
condition for this is obviously that the same relations, re- 
garded as congruences modulo some integer m, should have 
such a solution, whatever m may be. In the language of 
p-adic fields, this can also be expressed by saying that (2) 
should have a non-trivial solution in every p-adic field Q,,; 
to this one can add the condition that it should have a non- 
trivial solution in the field R = Q,. of real numbers. 

In particular, take for p a common prime divisor of B 
and C; as ABC is a square, and A, B, C are squarefree, p 
does not divide A. Take m = p’, and suppose that p does 
not divide 8B — y; then the congruences 


AU? + aT? = BV? + BT? = CW® + yT° (mod p”) 


imply firstly that p divides 7, then that it divides U, then 
that it divides V and W; 2(A, B, C) can then have no rational 
point (not even in Q,). Thus, if Q(A, B, C) has rational 
points, and if we write a for the g.c.d. of B and C, a must 
divide B — y. Write now B = ab, C = ac, with b prime to 
c; then we must have A = bc. As B and C are squarefree, a 
is prime to b and ¢, so that |b] is the g.c.d. of A and C, and 
|c| is the g.c.d. of A and B; therefore b divides y — a, and 
c divides a — B. 

This leaves us with finitely many curves Q(A, B, C), out 
of which further arguments and the consideration of the 
real field may lead to reject some more; it is even possible 
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to discard all except those which have points in all fields Q, 
and in R. In contrast, however, with classical results, going 
back to Legendre (cf. Chap.IV, §VI), on curves of genus 
0, not all such quartics have rational points, nor is there 
any known procedure for finding one if such points exist. 
In order to proceed further, it will now be assumed that, 
among all quartics ()(A, B, C), the quartics Q; = QO(A,, B;, C,) 
(1 <7 <n) and no others have at least one rational point. 
Then, if M = (x, y) is any rational point on I, there is one 
and only one 7 such that (2) and (3) are satisfied for A = 
A;,B = B,, C = C;and for some rational values for u, v, w; 
when that is so, we will say that M belongs to (2, and that it 
corresponds to the point N = (u, v, w) on Q). 

Let OQ = QO(A, B, C) be one of the 0,; instead of defining 
it by (2), we write 


®(U, V, W, T)= a(B—y)(AU? +aT?) + Bly—a)(BV? + BT’) 
+ ya-B)(CW*+yT*) 
a(B—y)AU* + B(y-a@)BV* 
+ y(a- B)CW? — 8T? 
W(U, V, W,T) = (B-y)AU? + (y—a)BV? + (a-B)CW? 
where we have put 

8 = (B~-y)(y— @)(@- B); 


then Q may be regarded as defined by ® = V = 0. With 
the notations of Appendix III, we have 


FE) = de(@— SV) = BABE a) — BYE — y) 
C - f(g). 


= 8°AB 


For any pair of points N, N’ on O, let &(N, N’), n(N, N’) be 
defined as in Appendix III; put 
wo(N, N') = (E(N, N'), (8VABC)~'n(N, N’)), 


so that w) maps 0 x QO into. 
Let M = (x, y) be a point on T, belonging to Q and cor- 
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responding to a point N = (u, v, w) of Q, so that the relations 
(2), (3) are satisfied. The tangent Ayy to O at N goes through 
the point with the homogeneous coordinates (u', v’, w’, 0) 
given by Auu’ = Buu' = Cww’. Therefore &(N, N) is given 
by 


_ Ou’, v', uw’, 0) 
g(N, N) = W(u',v’, w’, 0) 


An easy calculation gives €(N, N) = x; then we have 
n(N, N)” = F(x) = 8°ABCf(x) = 5°ABCy’. 


As 9, according to its definition in Appendix II], is defined 
only up to a factor +1, we may, after changing its sign 
if necessary, assume that n(N, N) = dVABC - 4, so that 
w (N,N) = M. If we write \ for the mapping N > wo(N, N), 
this shows that A is no other than the mapping of Q into 
defined by (3). 

Now choose a rational point My on I’, belonging to OQ and 
corresponding to a point No on Q; by definition, this means 
that (No) = Mo. Write « for the mapping N — a (No, N) 
of © into Y; the proposition in Appendix III shows that 
this is an isomorphism of 2 onto [; write p | for its inverse; 
we have (No) = A(No) = Mo. Take any rational point M 
onT; put P = yw '(M), so that M = u(P). By definition, M 
belongs to © and corresponds to a point N on © if and only 
if (3) maps N onto M, 1.e. if and only if AN) = M = p(P), 
i.e., in view of the definition of A and yw, if and only if 
a(N, N) = wo(No, P). Again by the proposition in Appendix 
Ill, this is so if and only if 2N ~ No + Pon. Put M’ = 
u(N); as # is an isomorphism and maps N, No, P onto M’, 
Mo, M respectively, the relation 2N ~ Ng + P on Qis equiv- 
alent to 2M’ ~ M, + M. This proves that M belongs to 0 
if and only if there is a rational point M’ on [ such that 
2M’ ~ Mo + M, in which case M corresponds to the point 
uw '(M’) on Q. 

For the proof of Mordell’s theorem, one also needs a 
quantitative estimate for the height of M’ in terms of the 
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height of M, these being defined as follows. Put again M = 
(x, y) and x = X/T?, with X prime to T; then we define the 
height A(M) of M by h(M) = sup(|X|, T*); similarly, if a point 
N of 1 has the homogeneous coordinates (U, V, W, T), these 
being rational integers without a common divisor, we put 


h(N) = sup(U], |V|, |W1, |7)). 


If now the point M of T corresponds to the point N on OQ, 
we have 


X = AU? + aT? = BV? + BT? = CW? + yT?, 


so that U*, V?, W? are bounded in terms of |X| and T®, i-e., 
in terms of h(N). This gives: 


A(N) < cV/hA(M), 


where c depends solely upon I’. For an estimate of h(M’) in 
terms of A(N), one could apply Siegel’s theorem on heights 
(cf. C. L. Siegel, Ges.Abh.1.249-250, or A. Weil, Coll. Papers 
1.477); this would give 


A(M’) < c'h(NY? 


where one can take for p any number >3, and where c’ 
depends solely upon Q, My and p. For our present purpose, 
however, a cruder estimate will serve just as well. As before, 
we have M’ = a)(No, N), and therefore x’ = &(No, N) if 
M' = (x’, y’). Put No = (Uo, Vo, Wo, To); by definition, 
x’ = &(Ny, N) means that ® — x’V is 0 at some point 
(Up t+tU, Vott¥, Wot+tW, To +tT) with t # 0. As No and N 
satisfy P = W = 0, this gives 


_ PU, Vo, Wo, To; U, V, W, T) 


(5) x’ = &(No, N) = W(Uo, Vo. Wo, To; U, V, W, T) 


where numerator and denominator are the bilinear forms 
respectively associated with the quadratic forms ®, WV; these 
are linear forms in U, V, W, T, with coefficients which depend 
solely upon 0 and M,. Of course, for any given M, the 
values of these forms need not be prime to each other; 
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nevertheless, if we write x’ = X'/T" with X’ prime to T’, 
those values give upper bounds for |X’| and T’?. Thus we 
get: 


h(M') < c,h(N) < cVh(M) 


where ¢,, and therefore c, = cc,, depend solely upon F, 0 
and Mp. 

Now we come back to the finitely many quartics 0); defined 
above; we assume that on each we have somehow chosen a 
rational point N;; let M; be the corresponding point on I. 
For every rational point M, there must then be one and 
only one of the points M; such that M + M,; is equivalent 
to a divisor 2M’, M' being a rational point of [. Here M’ 
is not uniquely defined; it is so up to an element of order 
2 in the group of rational points on the cubic [ (in the sense 
explained in Appendix I); those elements are the point at 
infinity and the three points (a, 0), (8, 0), (y, 0). 

The process of the descent is then as follows. Starting 
with a rational point M of I, we can write it, in the sense 
of the group of such points on [, as M ~ 2M’ — M;, where 
M’ is again rational; we have h(M') < d;Vh(M), with d; de- 
pending solely upon I’, O; and M;; we can now repeat the 
process by writing M’ ~ 2M” -— M, where we have 
h(M") < d;Vh(M’), etc. Thus we get a sequence of points M, 
M’,M", etc. whose heights must ultimately become less than 
m if we take m > sup(d?). Obviously there are only finitely 
many points P,, Po, ..., P,on T whose height does not exceed 
m. Thus the points M;, P; generate the group of rational 
points on T; this is Mordell’s theorem. In principle, the 
same proof can be used for a cubic over any groundfield 
of finite degree over Q. 

For obvious reasons the above descent is called one “by 
bisection” (or by division by 2); division by any other integer 
would, in principle, be equally effective. While division, by 
2 or by n, turns out to be most suitable for a general treatment 
of the problem, it is well-known from the classical theory 
of elliptic functions that division by n, which amounts to an 
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extension of degree n’ of the field of functions on the curve, 
may always be replaced by two “transformations” (now usu- 
ally called isogenies) of order n. For instance, if one starts 
from the cubic I defined by (1), division by 2, leading to 
the quartic Q(A, B, C), can be obtained by the successive 
adjunction of u = V(x—a)/A and of v = V(x—B)/B, each 
of these amounting to a transformation of order 2. Thus 
it is not surprising that the formulas used by Fermat, Euler 
and Lagrange in the descent define such transformations 
(except in the treatment of the curve x° + y* = 1, which 
depends upon transformations of order 3; cf. above, §X VI). 
As has been pointed out in §XVII, it may have been no 
mere coincidence that Lagrange re-discovered the “Lan- 
den transformation” some seven years after his major paper 
on equations of genus I. 

We will now illustrate our discussion of equation (1) by 
treating two examples, both originating from Fermat, which 
have been mentioned in §XVI. Take first the problem of 
the four squares in arithmetic progression; as Euler noted, 
and as has been shown in Appendix III, it is equivalent to 
the equation 


(6) y? = x(xt+ 1)(x+4) 


which will be treated in the manner explained above. The 
quartics 0; are to be sought among the curves Q(A, B, C) 
defined by equations 


x = Au? = Bu* — 1 = Cw’ — 4; 
here we must have A = bc, B = ca, C = ab, with a, b, ¢ 
dividing respectively 3, 4 and 1; thus A is +1 or +2, B is 
+1 or +3,and C = AB. For the curve to have real points, 
C must be >0. Consider the congruences 
AU? = BV? ~ T? =CW? —- 4T? (mod 3°); 


if (A, B, C) is (2, 1, 2), (1, 3, 3), (-1, — 1, 1) or (~2, —3, 6), 
they have no solution in integers not all divisible by 3; 
therefore these values should be rejected. This leaves the 
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four curves Qo, 1, Qe, O13 respectively given by 

(A, B,C) = 1, 1, 1), (2, 3, 6), (—1, ~3, 3), (— 2, — 1, 2) 
with the four points of T 

My = ~, M, = (2,6), Mz, = (~1,0), Ms = (-2, -2) 


belonging respectively to the quartics 0,. If we put Py = 
(0, 0), P; = (-4, 0), then the elements of order 2 in the 
group of rational points on I are Mo, Po, P), which, together 
with the neutral element My = ©, make up a group of order 
4; it is easily seen that 2M, = 2M, = Po, M,; + P, = Ms, 
so that + M,, + Ms; are elements of order 4. 

Consider the real points on T; they make up two “branches”, 
one By consisting of the points for which x 2 0 
and extending to infinity, and an oval B, consisting of the 
points for which -4 < x < —1; any straight line in the 
plane must cut B, at two points or not cut it at all; in view 
of the definition of the addition operation on I’, this shows 
that By is a subgroup of index 2 of the group of real points, 
with B,; = By + My; every rational point on By must belong 
either to D, or to D,. If such a point M belongs to Qy (resp. 
to 0,), we can write M = 2M’ (resp. M = 2M’ — M,); here, 
if M’ lies on B,, we can replace it by M' — My; thus, in 
carrying out the descent, it is enough to consider the rational 
points on Bo. 

All that remains to be done, in order to carry out the 
descent in the manner explained above, is to obtain the 
estimate for h(M’) in terms of h(M) if M = 2M’ or M = 
2M’ — M,. Suppose first that M belongs to Qo; with 
the same notations as before, let M = (x, y) correspond to 
N = (U,V, W, T) on Q); here My corresponds to No = 
(1, 1, 1, 0). Writing x = X/T? as before, and h = h(M) = 
sup(X, T*), we have 


X = U? = V? — T? = W? — 4T*, 
and therefore 
JUl<Vh, |Vi<V2h, |Wi<V5h, U = W (mod 2). 
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As to M’, it is given by (5); writing M’ = (x',y'),x’ = X'/T”, 
we can write, in view of the fact that U = W (mod 2): 
xX’ aVv-W) 
T”? 4(3U-4V+W)’ 


where X', T’? are majorized by the numerator and denom- 
inator in the right-hand side. Putting h’ = h(M’), we get 
now: 


h’ < 20/2 + V/5)Vh. 


Similarly, for M belonging to 0, and for M, = (2, 6), N, = 
(1, 1, 1, 1), we get h’ < 4Vh. This shows that the group of 
rational points on the branch By of T is generated by M, 
and points whose height is <54. As to the latter, let M = 
(x, y) be such a point, with x = X/T*; as x > 0, and as M 
belongs to MQ» or 0), we have X = U? or X = 2U*; X and 
T? are <54. It is easy now to verify that there are no such 
points other than M, and M,. Inconclusion, the only rational 
points on [ are the eight points My, M,, Py, P,, + M,, + Ms. 
If now we write the equations for four squares in arithmetic 
progression: 
Qy? = x? + 27, Qr = y+ t, 

we see that in the projective space P® they have no more 
than the solutions given by x* = y* = z” = 2°; this is what 
Father de Billy had been told by Fermat. Our other example 
will be treated in Appendix V. 


Appendix V 


The Equation y* = x° — 2x 


The methods sketched above in Appendices III and IV 
will now be applied to Fermat’s problems (A) and (B) (cf. 
above, §§XV and XVI). 

As we have seen, the question amounts to the solution of 
the “double equation” (11) of §XV, Le., of 


(7) e+ lev x - 2 -lav 


in rational numbers, with the side-condition x > I in case 
(A) and x < —1 in case (B); (7) defines a space quartic 0 
which, in homogeneous coordinates, may be regarded as 
given by ® = VY = Oif one puts 


(8) ® = X2-2XxT-T?-V?, W=U2-xX?-T?, 


To this we can apply the results of Appendix III; with the 
notations of that Appendix, we have F(é) = €° — 2, and 
{) is isomorphic to the cubic T’ given by 


(9) n= &(&~ 2). 


We shall proceed with this as in Appendix IV, with the 
modifications required because the roots of the right-hand 
side are not all rational. 

As in the case of (6) in Appendix IV, the group I, of 
the real points of I consists of two branches By, B,, defined 
respectively by € > 0 and by € < 0; Bo is a subgroup of I. 
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of index 2, and we can write B, = By + My, with My = 
(0, 0); Ms is an element of [., of order 2. As in the case of 
(6), Appendix IV, it will be enough to carry out the descent 
for the rational points on By. 

Put € = P/Q, where P, Q are mutually prime positive 
integers; we have 

(Q’n)” = PQ(P*— 2Q") 

and infer that Q must be a square, Q = V*. The g.c.d. of 
P and of the last factor on the right is 2 or 1 according as 
P is even or odd. If P is even, we must have P = 2U’, 
P? — 2Q? = 28°, 2U* — V* = 8°; if P is odd, we can write 
P = U®, P? — 2Q? = S*,U* ~ 2V* = S*. Thus, incidentally, 
the problem can be reduced to the consideration of the 
equations 2u* — 1 = +5°; this is how Lagrange had pro- 
ceeded in his paper of 1777 (Lag.1V.377—398), where he 
begins by putting S = X* + 2XT — T° and observing that 
the equations ® = YW = 0 give 2U* — V* = S*; the cor- 
respondence thus defined between 2) (or [) and the quartic 
curve 2u* — 1 = s° is a transformation (an “isogeny”) of 
order 2, and the relationship between (2) (or T) and the 
curve 2u* — 1 = —s° is similar. 

Here, in analogy with Appendix IV, we shall use the 
bisection and the ring Z{V2], for which notations will be as 
in Appendix I. If P is odd, we put 


a=P + QV2 =U? + V2; 
then a’ is prime to a, and we have 
aa’ = N(a) = P? — 2Q? = S$”. 


If P is even, we put 
a= Qt Svea Vi + Ue; 


as Q is prime to P, it is odd; therefore a’ is again prime to 
a, and we have 


2 
aa’ = N(a) = Q? - 2(6) = —S?, 
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In both cases, writing a as a product of primes and of a 
unit in the ring Z[V2], one finds that a, up to a unit, must 
be a square, so that it is of the form +f’ or + ef? according 
as aa’ is S* or —S*; as before, we pute = 1 + V2; Bis an 
integer in Z[V2]. In view of the above formulas we must 
have a > 0, so that @ is 8” in one case, ef” in the other. Put 
B = Y — TV2 and substitute 6° resp. 68? for a in the above 
formulas; one gets 


U? = Y? + 2T?, V? = -2YT 
in one case, and 
U? = Y? — 2YT + 2T?, V? = ¥? — 4YT + 2T° 


in the other. These pairs of equations define two space 
quartics Qo, 0, in the projective space with the homogeneous 
coordinates (Y, T, U, V); in conformity with the notations 
of Appendix III, we put 


®, = 4YT + 2V7, Wy = ¥? + OT? — VU’, 
®, = Y* — 4YT + 2T? — V?, WV, = U? — (Y?-2YT+4+2T"), 


so that Qo (resp. 0) is defined by Bp = Wy = O (resp. 
®, = WV, = 0); one may note that the substitution 
Y = X + T changes ®,, V, into ®, V as defined in (8), so 
that Q, is projectively the same as 0. Again with the notations 
of Appendix III, F(é) is 4&(€? — 2) for Qo, &(€?- 2) for 1); 
the points M, = © and M, = (2, 2) of F belong respectively 
to OQ» and to 0; My corresponds to No = (1, 0, 1, 0) on Oo, 
and M,toN, = (1,9, t, l)onQ,. Forz = Oor I, a rational 
point M = (€, n) of T belongs to Q, if and only if there is a 
rational point M’ = (€', y’) on such that M ~ 2M' — M,,. 
If M corresponds to N = (Y, T, U, V) on Q,, then we have 
é = U*/V? if i = 0, € = 2U?/V? if i = 1. Putting again 
h(M) = sup(P, Q) and A(N) = sup(¥1, |T|, |U], |V]), we have 
in both cases A(N) < Vh(M). As before, M’ is given by €’ = 
&(N;, N), where &(N, N’) is defined as in Appendix III for 
any pair of points N, N’ on Q;. Forz = 0, this gives, in view 
of (5), Appendix IV, and of the fact that Y - U must be 
even: 
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an T 
B= (No N) = ap 


and therefore h(M') = A(N) = VA(M). Fori = 1, Y, U and 
V must be odd, and T must be on we get 


re _ «Y¥-2T-V) 
and h(M') < 2h(N) < 2VA(M). Consequently the sequence 
M, M’', M", ... produced by the infinite descent on the 


branch Bo of r must lead to a point of height <4. Since the 
only such point on Bo is M,, this shows that M, generates 
the group of rational points on By; in other words, those 
points are all given by 


(10) A, ~ Mo + n(M,-—Mo) 
for some n € Z, while those on B, are given by 
(11) A, ~ Mz + n(M,-— Mb); 


we have Ay = Mo, A; = Mi, Ao = (9/4, ~— 21/8), etc. If A, = 
(é, ), then, as easily seen, we have A_, = (€, —y) and 

—2/€, 2n/€?). As shown above, we have h(A,) < 
Vh(Az,), so that the heights of the points A, are not bounded; 
the cubic T is “of rank 1”, and there is no equivalence relation 
between the points My, M,, Mg, other than the obvious one 
2M, ~ 2Mp. 

Now we turn back to Fermat’s original problems and to 
the curve © defined by (7). Let N = (X, T, U, V) be a point 
of 0; take a rational point P on Q, e.g. P = (1, 0, I, 1); if 
we use it as in Appendix III in order to obtain an iso- 
morphism between 0 and I’, and write M = (€, y) for the 
point of T corresponding to N, then we have € = &(P, N); 
in view of (5), Appendix IV, this gives, after taking (8) into 
account: 


V-X+T . X+U 
X-U VtX-T 


From this one sees that the points of I corresponding to 
No = (1, 0, 1, —1) and to Ng = (1, 0, —1, 1) on Q are 


(12) f= 
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respectively My) and Mg. As to the point corresponding to 
P = (1, 0, I, 1), it must be either (2, 2) or (2, — 2), so that, 
after choosing properly the sign of y in the correspondence 
between 2 and I, we may assume that it is M, = (2, 2); 
accordingly we write N, for P. Therefore all the rational 
points on 22 can be written in terms of No, Ni, No by formulas 
similar to (10) and (11); €, regarded as a function on Q, has 
the divisor 2N2 ~ 2No. Now consider the planes T = 0, 
X+U=0,V +X —T = 0; if we put N; = (1,0, -1, -1), 
they determine on © the divisors 


3 
div(T) = S'N,, — div(X+U) = 2Nq + 2Ns, 
i=0 


div(V+X—T) = 2Np + 2Nzg. 
Therefore the functions defined on © by 
_X+U  ,_V#X-T 
— TT? T 
have divisors respectively given by 

div(¢) = No + N3 _ No _ N;, 

div(6) = No + Nz _ N, _ No. 
If N = (x, 1, u, v) isa point on Q, the function ¢ — (x+u), 
which has the two poles No, N,, must have two zeros; obviously 
they are N and the point N’ = (x, 1, u, —v), so that we have, 
when that is so, N’ ~ No + N, — N. Similarly, using the 
function 6 — (v+x—1), we see, for N” = (x, 1, —u, v), that 
we have N” ~ N,; + No — N; for N” = (x, 1, —u, —v), this 
gives 


(13) 


N’ ~N, — Ny +N. 


In particular, if we call B, = (x,, 1, u», v,) the point of 0 
corresponding to the point A, on I’, this shows that the 
points corresponding to A,_, and to A; are respectively 


By_, = (xn, I, Un,» —Vn), B,, = (Xn, 1, Un, — Un). 


Thus these are all the rational points on 2. 
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We will now take this opportunity for illustrating a method 
introduced by Euler towards the end of his life for solving 
“by ascent” (in the sense given above to this word in §XV) 
diophantine equations of genus |; this was based on what 
he called “canonical equations” (Eu.1-5.158 in E 778|1780; 
cf. infra, Chap.III, §XVI1), i.e. equations F(X, Y) = 0 of 
degree 2 in X and 2 in Y; for slightly different applications 
of his method to a problem equivalent to the present one 
(given by the pair of equations X + Y = Z°, X* + Y? = 
T*), cf. Eu.I-5.77-81 = E 769|1780 and Eu.I-5.87~88 in 
E 772|1780. 

As explained above in Appendix II, a “canonical equation” 
in the sense of Euler will hold between two functions on 
the given curve () if each of them has only two poles. As 
such functions we choose é and €, as defined by (12) and 
(13). Putting as before x = X/T, u = U/T, v = V/T, we 
have x + u = &, and, in view of the relation x? — u? = —1, 
x—u= —1/¢; this gives 


1) yh 
sed eed 


At the same time formula (12) gives: 


ort l= €e-w=-hute- t= fatw =F 


g 


and therefore: 


¢_é cl .é 


v=2-S, —~]J=2+—>. 
E 2 E 2 

Comparing the values found above for x, we get a relation 

between € and ¢ which can be written as a “canonical 

equation”: 


F(E, 0) = &? — & — 2& — 20° - € = 0 


Moreover, the above formulas determine a birational cor- 
respondence, i.e. an isomorphism, between () (or, what 
amounts to the same, I’) and the plane curve C defined by 
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F = Q; the points on C corresponding to No, Ni, Ne, Ns are 
respectively 
Po = (%, %), P, = (2,%), Py = (0,0), Ps = (—1, 0). 


Euler’s “ascent”, applied to the equation F = 0, is now as 
follows. Let Ro = (&o, >) be any rational point on C; as F 
is of degree 2 in @, the equation F(€, z) = 0 has, beside fo, 
a second rational root ¢,, given by 


2&0 ~G= _ €ol€ot VD), 
&-2 ” bo(Eo— 2)’ 


this determines on C a point Sp = (€o, ¢1); the divisor of 
the function € — &, is then Ry + Sp — 2Po, so that we have 


So ~ 2P,5 — Ro. 


a= 


Similarly, the equation F(x, ¢;) = 0 has a second solution 
é;, given by 


2 
é=G-% -1-& = 2H 


and determining on C a point R,; = (€;, ¢,) for which we 
have 


Ry, ~ Po + Pi — So. 


Repeating the same procedure, we get a sequence Ro, So, 
R,, S,, etc., and an easy induction gives: 


R,~ Ro + n(P,—Po), S, ~ 2P9 — R,. 


If for instance we take as our starting point the point Ry = 
Po, the comparison with the formulas obtained above shows 
that the points R,, 5, are those which correspond on C to 
the points A,, A_, on IT’, respectively. We have, in that case, 
Ro = So = Po, Ri = P,, S; = (2, —3/2), and then, using 
the above formulas: 


9 3 9 39 
Ry = (2. -2), So= (3 a 
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39 1469 
R3 = (338, *) Ss = (338, - 2), 


p, = (12769 _ 1469 _ (12769 _ 172825) 
+“ \7056’ 84 7 °4~ \7056’ 112812) 


Then the point B, = (x,, 1, u,, v,) of O is the one cor- 
responding to R,, by the above formulas, and the points 
(x,, 1, u,, +v,), for n = 0, 1, 2, ..., are all the rational 
points on (. In particular, we have (apart from xp = 
x; = %, which correspond respectively to the points No, Ny 


on Q)): 


eek yp a il _ 2150905 
2 9? 3 5B? 4 046799” 
_ _ 16969358281 
*5 * ~ 38880655800 


Fermat’s problem (A) required a solution with x > 1; 
problem (B) required x < — 1; his solutions, which are indeed 
the smallest possible ones, correspond respectively to x3 and 
tO X4. 


Euler in old age. 


Chapter Three 
Euler 


SI. 


Until the latter part of the seventeenth century, mathe- 
matics had sometimes bestowed high reputation upon its 
adepts but had seldom provided them with the means to 
social advancement and honorable employment. VIETE 
had made his living as a lawyer, FERMAT as a magistrate; 
even in Fermat’s days, endowed chairs for mathematics were 
few and far between. In Italy, the University of Bologna 
(“lo studio di Bologna”, as it was commonly called), famous 
throughout Europe, had indeed counted Scipione del 
FERRO among its professors in the early sixteenth century; 
but CARDANO had been active as a physician; BOMBELLI 
was an engineer, and so was Simon STEVIN in the Neth- 
erlands. NAPIER, the inventor of logarithms, was a Scottish 
laird, living in his castle of Merchiston after coming back 
from the travels of his early youth. Neighboring disciplines 
did not fare better. COPERNICUS was an ecclesiastical dig- 
nitary. Kepler’s teacher MAESTLIN had been a professor 
in Tubingen, but KEPLER plied his trade as an astrologer 
and maker of horoscopes. GALILEO’s genius, coupled with 
his domineering personality, earned him, first a professor- 
ship in Padova, then an enviable position as a protégé of 
the Grand-Duke of Tuscany, which saved him from the 
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worst consequences of his disastrous conflict with the Church 
of Rome; his pupil TORRICELLI succeeded him as “phi- 
losopher and mathematician” to the Grand-Duke, while 
CAVALIERI combined the Bologna chair with the priorate 
of the Gesuati convent in the same city. 

Among Fermat’s scientific correspondents, few held pro- 
fessorial rank. ROBERVAL, at the Collége de France (then 
styled Collége Royal), occupied the chair founded in 1572 
from a legacy of the scientist and philosopher Pierre de la 
Ramée. The Savilian chair at Oxford, created for H. BRIGGS 
in 1620, was held by WALLIS from 1649 until his death in 
1703; but his talented younger friend and collaborator Wil- 
liam BROUNCKER, second Viscount, was a nobleman whose 
career as commissioner of the Navy, and whose amours, 
are abundantly documented in Pepys’s diary. It was only 
in 1663 that Isaac BARROW became the first Lucasian pro- 
fessor in Cambridge, a position which he relinquished to 
NEWTON in 1669 to become preacher to Charles II and 
achieve high reputation as a divine. In the Netherlands, 
while Descartes’s friend and commentator F. SCHOOTEN 
was a professor in Leiden, René de SLUSE, a mathematician 
in high esteem among his contemporaries and an attractive 
personality, was a canon in Liége. DESCARTES, as he tells 
us, felt himself, by the grace of God (“graces a4 Dieu”: Dascours 
de la Méthode, Desc.V1.9), above the need of gainful em- 
ployment; so were his friends Constantin HUYGENS and 
Constantin’s son, the great Christian HUYGENS. LEIBNIZ 
was in the employ of the Hanoverian court; all his life he 
preserved his love for mathematics, but his friends marveled 
sometimes that his occupations left him enough leisure to 
cultivate them. 

Whatever their position, the attitude of such men towards 
mathematics was often what we can describe as a thoroughly 
professional one. Whether through the printed word or 
through their correspondence, they took pains to give proper 
diffusion to their ideas and results and to keep abreast of 
contemporary progress; for this they relied largely upon a 
private network of informants. When they traveled, they 
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looked up foreign scientists. At home they were visited by 
scientifically inclined travelers, busy bees intent on dissem- 
inating the pollen picked up here and there. They eagerly 
sought correspondents with interests similar to their own; 
letters passed from hand to hand until they had reached 
whoever might feel concerned. A private library of reason- 
able size was almost a necessity. Booksellers had standing 
orders to supply customers with the latest publications within 
each one’s chosen field. This system, or lack of system, worked 
fairly well; indeed it subsists down to the present day, sup- 
plementing more formalized modes of communication, and 
its value is undiminished. Nevertheless, even the seventeenth 
century must have found it increasingly inadequate. 

By the time of Euler’s birth in 1707, a radical change had 
taken place; its first signs had become apparent even before 
Fermat’s death. The Journal des Scavans was started in January 
1665, just in time to carry an obituary on Fermat (“ce grand 
homme”, as he is called there). Louis X1V’s far-sighted minister 
Colbert had attracted Huygens to Paris in 1666, and the 
astronomer CASSINI in 1669, awarding to each a royal 
pension of the kind hitherto reserved to literati. In 1635 
Richelieu had founded the Académie frangotse; the more 
practical-minded Colbert, realizing the value of scientific 
research (pure no less than applied) for the prosperity of 
the realm, set up the Académie des sciences in 1666 around a 
nucleus consisting largely of Fermat’s former correspon- 
dents; Fermat’s great friend and former colleague Carcavi 
was entrusted with its administration and became its first 
secretary. In England, some degree of political stability had 
been restored in 1660 by the recall of Charles I; in 1662 
the group of amateurs (“virtuost”) who had for some time 
held regular meetings in Gresham College received the 
charter which made of them the Royal Society, with Brouncker 
as its first president; in 1665 they started the publication 
of the Philosophical Transactions, which has been continued 
down to the present day. In 1698 the French academy fol- 
lowed suit with a series of yearly volumes, variously entitled 
Aistoire and Mémoires de Académie des Sciences. In 1682 Leibniz 
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was instrumental in creating, not yet an academy, but at 
least a major scientific journal, the Acta Eruditorum of Leipzig, 
to whose early issues he contributed the articles by which 
he was giving birth to the infinitesimal calculus. 

Soon universities and academies were competing for sci- 
entific talent and sparing neither effort nor expense in order 
to attract it. Jacob BERNOULLI had become a professor 
in his native city of Basel in 1687; as long as he lived, this 
left little prospect to his younger brother and bitter rival 
Johann of finding academic employment there; at first he 
had to teach Leibniz’s infinitesimal calculus to a French 
nobleman, the Marquis de VHOPITAL, even agreeing to 
a remarkable contract whereby the latter acquired an option 
upon all of BERNOULLI’s mathematical discoveries. In 
1695, however, Joh. Bernoulli became a professor in Gro- 
ningen, eventually improving his position there by skilfully 
playing Utrecht against Groningen; finally he settled down 
in Basel in 1705 after his brother’s death. No wonder, then, 
that in 1741 we find him congratulating Euler on the financial 
aspects of his Berlin appointment and suggesting at the 
same time that he would be willing (for a moderate yearly 
stipend, “pro modico subsidio annuo”: Corr.11.62) to enrich the 
memoirs of the Berlin Academy with regular contributions 
of his own. In short, scientific life, by the turn of the century, 
had acquired a structure not too different from what we 
witness to-day. 


SII. 


Euler’s father, Paul EULER, was a parish priest established 
in Riehen near Basel; he had studied theology at the uni- 
versity of Basel, while at the same time attending the lectures 
of Jacob Bernoulli; he had planned a similar career for his 
son, but placed no obstacle in the way of young Leonhard’s 
inclinations when they became manifest. Clearly, by that 
time, a bright future was in store for any young man with 
exceptional scientific talent. 

In 1707, when EULER was born, Jacob Bernoulli was 
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dead, and Johann had succeeded him; Johann’s two sons 
Nicolas (born in 1695) and Daniel (born in 1700) were fol- 
lowing the family tradition, except that, in contrast to their 
father and uncle, they loved each other dearly, as they took 
pains to make known (cf. e.g. Corr.I1.291). Euler became 
their close friend and Johann’s favorite disciple; in his old 
age, he liked to recall how he had visited his teacher every 
Saturday and laid before him the difficulties he had en- 
countered during the week, and how hard he had worked 
so as not to bother him with unnecessary questions. 

Three monarchs came to play a decisive role in Euler’s 
career: Peter the Great, Frederic the Great, and the Great 
Catherine. Peter, a truly great czar perhaps, died in 1725; 
but he had had time to found Saint Petersburg, erect some 
of its most impressive buildings, and, most important of all 
for our story, to make plans for an Academy of Sciences 
modelled on what he had seen in the West; those plans 
were faithfully carried out by his widow. In 1725 the two 
younger BERNOULLIS, Nicolas and Daniel, were called 
there. Nicolas died the next year, apparently of appendicitis. 
About the same time an offer went to Euler to join the 
Petersburg Academy. He was not quite twenty years old; 
he had just won a prize for an essay on ship-building, never 
having seen a sea-going ship in his life. He had no early 
prospects at home. He accepted with alacrity. 

From Basel he sailed down the Rhine to Mainz, then 
traveled to Liibeck, mostly on foot, visiting Christian Wolff 
on the way; this was a philosopher and follower of Leibniz, 
banished from Berlin (as he told Euler) by a king with little 
understanding for philosophy; his hobby-horse was Leibniz’s 
theory of monads, and Euler was clearly not impressed. 
From Ltbeck a ship took the young mathematician to 
Petersburg. 

In those days academies were well-endowed research in- 
stitutions, provided with ample funds and good libraries 
(cf. Corr.1.36). Their members enjoyed considerable free- 
dom; their primary duty was to contribute substantially to 
the academy’s publications and keep high its prestige in the 
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international scientific world. At the same time they were 
the scientific advisers to the monarch and to state authorities, 
always on hand for such tasks, congenial or not, as the latter 
might find fit to assign to them; had it not been so, no state 
would have undergone the high expense of maintaining 
such institutions, as Euler once acknowledged to Catherine 
(cf. Eu.IV A-1, no. 1887/1766). In 1758, at the height of 
his fame, Euler (who had acquired in Petersburg a good 
command of the Russian language) did not find it beneath 
him, nor inconsistent with his continuing close relations 
with the Petersburg Academy, to translate for king Frederic 
some dispatches seized during military operations against 
the Russian army. 

In 1727, however, the political situation had changed by 
the time Euler reached Petersburg. Under a new czar all 
academic appointments were in abeyance. On the strength 
of his prize essay, Euler was commissioned into the Russian 
navy, but not for long. Soon he was a salaried member of 
the academy, at first with the junior rank of “adjunct”. When 
his friend Daniel left for Basel in 1733, he was appointed 
in his place; thus he could afford to marry, naturally into 
the local Swiss colony, and to buy for himself a comfortable 
house (cf. his letter of 1734 to his colleague G. F. Muller: 
Eu.IV A-1, no. 1683). His bride was the daughter of the 
painter GSELL,; in due course she was to give birth to thirteen 
children, out of whom only three sons survived Euler; little 
is recorded of her otherwise. The eldest son Johann Albert, 
born in 1734, was to become one of his father’s collaborators, 
and later a leading member of the academy. 

Once Euler was thus well established in Petersburg, his 
productivity exceeded all expectations, in spite of the com- 
parative isolation in which he had been left by Daniel Ber- 
noulli’s departure. It was hardly interrupted by a severe 
illness in 1735 and the subsequent loss of his right eye. He 
had beyond doubt become the most valuable member of 
the Academy, and his reputation had been growing by leaps 
and bounds, when two events in the higher spheres of Eu- 
ropean politics brought about a major change in his peaceful 
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life. In Petersburg the death of the czarina in 1740, a regency, 
and the ensuing turmoils, seemed to threaten the very ex- 
istence of the Academy. Just at this juncture Frederic II 
succeeded his father (the same king who had so cavalierly 
thrown Chr. Wolff out of Berlin) on the throne of Prussia; 
he immediately took steps directed towards the establishment. 
of an academy under his patronage, for which he sought 
out the most famous names in European science; naturally 
Euler was on the list. A munificent offer from Frederic, 
coupled with a fast deteriorating situation in Petersburg, 
brought Euler to Berlin in July 1741, after a sea-voyage of 
three weeks on the Baltic during which he alone among his 
family (or so he claimed) had been free from sea-sickness 
(cf. his letter to Goldbach of 1 August 1741: L. Euler and 
Chr. Goldbach, Briefwechsel, edd. A. P. Juskevié und E. Win- 
ter, Berlin 1965, p. 83). In the following year, to his great 
satisfaction, he was able to purchase an excellent house, 
well situated, and, by special royal order, exempt from re- 
guisition (¢bid.p.130|15 Dec. 1742). There he lived for the 
next 24 years, with apparently the sole interruption of 
seasonal visits to the country estate he acquired in 1752 
in Charlottenburg (cf. Eu.IV A-1, no.2782), and of a family 
trip to Frankfurt in 1750 to meet his widowed mother 
who was coming from Basel to live in Berlin with him; 
his father, who had been disappointed in his hope of 
getting Euler’s visit in Basel (cf. Corr.1I.57,451) had died 
in 1745. 

With Euler’s change of residence one might have expected 
that the steady flow of his publications would be diverted 
from Petersburg to Berlin; but far from it! He was not 
only allowed to keep his membership in the Petersburg 
Academy, but his pension from Petersburg was continued, 
and he was intent upon giving his former colleagues value 
for their money (cf. e.g. Corr.1.200|1743). Well might his 
great-grandson P.-H. Fuss describe Euler’s Berlin period 
as “twenty-five years of prodigious activity”. More than 
100 memoirs sent to Petersburg, 127 published in Berlin 
on all possible topics in pure and applied mathematics 


Euler in middle age. 
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were the products of those years, side by side with major 
treatises on analysis, but also on artillery, ship-building, 
lunar theory; not to mention the prize-winning essays sent 
to the Paris academy (whose prizes brought substantial 
cash rewards in addition to high reputation; cf. 
Corr.1.497|1749); to which one has to add the Letters to a 
German Princess (one of the most successful popular books 
on science ever written: Eu.JII-11 and II]-12 = E 343, 
344|1760-1761) and even a defense of christianity (Rettung 
der gittlichen Offenbarung ... , Eu I1-12.267-286 = E 
92|1747) which did nothing to ingratiate its author with 
the would-be philosopher-king Frederic. At the same time 
Euler was conducting an increasingly heavy correspondence, 
scientific, personal and also official since the administrative 
burdens of the academy tended to fall more and more 
upon his shoulders. 

As years went by, Euler and Frederic became disen- 
chanted with each other. The king was not unaware of 
the lustre that Euler was throwing upon his academy, 
but French literati stood far higher in his favor. He was 
seeking to attract d ALEMBERT to Berlin and was expected 
to put him above Euler as head of the Academy (cf. 
Corr.1.667,668,670,672|1763). Euler was spared this blow 
to his self-esteem; d’Alembert, forewarned perhaps by 
Voltaire’s unpleasant experience with Frederic in 1753, 
enjoyed basking in the king’s favor for the time of a short 
visit but valued his freedom far too highly to alienate it 
more durably. Nevertheless, as early as 1763, Euler’s 
thoughts started turning again towards Russia. 

Fortunately another political upheaval had just taken place 
there. In 1762 the czar’s German wife had seized power as 
Catherine II after ridding herself and Russia of her husband. 
One of her first projects was to restore the Petersburg acad- 
emy to its former glory. This was almost synonymous with 
bringing Euler back. Negotiations dragged on for three 
years. Finally, in 1766, the Russian ambassador in Berlin 
was instructed to request Euler to write his own contract. 
Frederic, realizing too late the magnitude of this loss, had 
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tried to put obstacles in the way; he soon found that he 
could not afford to displease the imperial lady. In the same 
year Euler was back in Petersburg, after a triumphal journey 
through Poland where Catherine’s former lover, king Stan- 
islas, treated him almost like a fellow-sovereign. 

By then Euler was losing his eyesight. He had lost the 
use of his right eye during his first stay in Petersburg. About 
the time when he left Berlin, or shortly thereafter, a cataract 
developed in his left eye. In 1770, in answer toa letter from 
Lagrange on number theory, he described his condition as 
follows: “Je me suis fait lire toutes les opérations que vous avez 
faites sur la formule 101 = pp — 134q et je suis entiérement 
convaincu de leur solidité; mais, étant hors d@’état de lire ou d’écrire 
moi-méme, je dois vous avouer que mon imagination n’a pas été 
capable de saisir le fondement de toutes les déductions que vous 
avez été obligé de faire et encore moins de fixer dans mon esprit la 
signification de toutes les lettres que vous y avez introduites. II est 
bien vrai que de semblables recherches ont fait autrefors mes délices 
et m’ont cotité bien du tems; mais a présent je ne saurois plus 
entreprendre que celles que je suis capable de dévellopper dans ma 
téte et souvent je suis obligé de recourir 4 un ami pour exécuter les 
calculs que mon imagination projette” [“I have had all your 
calculations read to me, concerning the equation 101 = 
p? — 13q?, and I am fully persuaded of their validity; but, 
as I am unable to read or write, I must confess that my 
imagination could not follow the reasons for all the steps 
you have had to take, nor keep in mind the meaning of all 
your symbols. It is true that such investigations have formerly 
been a delight to me and that I have spent much time on 
them; but now I can only undertake what I can carry out 
in my head, and often I have to depend upon some friend 
to do the calculations which I have planned”: Eu.IV 
A-5.477/9 March 1770). 

An operation was attempted in 1771 and was successful 
at first, but the eye soon became infected, and total or near- 
total blindness ensued. Except for this misfortune, and for 
a fire which destroyed his house in 1771 among many others 
in Petersburg, he lived on in comfort, greatly honored and 
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respected; neither old age nor blindness could induce him 
to take a well-deserved rest. He had assistants, one of whom 
was his own son; others were sent to him from Basel, with 
the co-operation of his old friend Daniel Bernoulli; to one 
of them, N. FUSS, who had come to Petersburg in 1773 
and later married a granddaughter of Euler’s, we owe a 
vivid description of Euler’s method of work in the last decade 
of his life (Corr. I.xliv-xlv). Hundreds of memoirs were written 
during that period; enough, as Euler had predicted, to fill 
up the academy publications for many years to come. He 
died suddenly on 18 September 1783, having preserved 
excellent general health and his full mental powers until 
that very day. 


SII. 


No mathematician ever attained such a position of un- 
disputed leadership in al] branches of mathematics, pure 
and applied, as Euler did for the best part of the eighteenth 
century. In 1745 his old teacher Johann Bernoulli, not a 
modest man as a rule, addressed him as “mathematicorum 
princeps” (Corr.11.88,92). “Ce diable homme” is how d’Alembert 
qualified him in comic disgust, writing to Lagrange in 1769, 
on finding himself anticipated by Euler in some results which 
he had felt rather proud of (Lag. XII1.136). 

Here our concern will be with his work on number theory, 
which alone would have earned him a distinguished place 
in the history of mathematics, had he never done anything 
else; but it occupies only four volumes out of the more than 
seventy of his complete works. He seems to have worked 
on it chiefly by fits and starts. He was perhaps ill prepared 
for it by his apprenticeship under Johann Bernoulli, but 
took it up early in his career, and it remained a favorite 
subject with him until the end. This topic, as he well knew, 
was no more popular with his contemporaries than it had 
been in the seventeenth century (cf. Chap.II, §XVH; cf. 
also Daniel Bernoulli’s disparaging remarks upon the subject 
of prime numbers, quoted below in §X). Euler, however, 
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never tired of telling his readers that he felt no need to 
apologize for the time and efforts thus spent, that truth is 
one, and that no aspect of it may be neglected without damage 
to the whole (cf. e.g. Eu. I-2.62-—64,295,428,461—462,519- 
522, etc.). 

A substantial part of Euler’s arithmetical work consisted 
in no more, and no less, than getting proofs for Fermat’s 
statements as contained in his published works at the time 
(cf. infra, 81V). Our tentative reconstruction of Fermat’s 
original proofs, given above in Chapter II, was largely based 
on the plausible but unprovable assumption that they could 
not have differed much from those later obtained by Euler. 
Thus these proofs need not be described again in detail; it 
will be enough to indicate the chronology of their discovery 
by Euler. Apart from the dates of presentation of his memoirs 
to the Petersburg academy, sometimes differing by several 
years from their dates of publication, one main source for 
this is his correspondence, chiefly the one with Goldbach, 
which was published by Euler’s great-grandson P.-H. Fuss 
in 1843; it makes up the first volume of the Correspondance 
Mathématique et Physique (Corr.1); the second volume contains 
the letters from the Bernoullis to Euler, Goldbach and N. 
Fuss; Goldbach’s conjecture on sums of primes occurs there 
in Goldbach’s letter of 7 June 1742 (Corr.1.127). A fuller, 
abundantly annotated publication of the same correspon- 
dence is due to A. P. JuSkevi¢ and E. Winter (cf. Bibliography 
at the end of the present volume). Further correspondence 
between Euler and the Bernoullis is contained in Euler’s 
Opera Postuma of 1862 (vol.I, pp. 519-588) and in G. Enes- 
trém’s articles in Bibl.Math. from 1903 to 1907 (cf. Bibli- 
ography). Euler’s correspondence with Clairaut, d’Alembert 
and Lagrange is published in full in Eu.IV A-5; further 
correspondence with various contemporaries (in the original 
text and in Russian translation) makes up the volume Pis’ma 
k Uéénym, Moskva-Leningrad 1963 (PkU.); Eu.IV A-1 is a 
complete repertory, with brief summaries, of the whole of 
Euler’s immense correspondence. So far as number theory 
is concerned, the most valuable information, at least for the 
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years from 1730 to 1756, is to be found in his letters to 
Goldbach. 

GOLDBACH had been born in Konigsberg in 1690; a 
man of versatile tastes and talents, obviously well educated 
and well-to-do, he traveled extensively in his younger days 
(cf. Corr.II.183—184), seeking out learned men and scientists 
everywhere. In 1721 he met Nicolas Bernoulli in Venice; 
soon after that he started a correspondence with him, and 
then also with his younger brother Daniel. In 1725 he was 
largely instrumental in bringing them both to Petersburg, 
where he preceded them by a few months (cf. Corr. 11.169); 
he seems to have been regarded by both brothers, and very 
soon also by Euler, as an influential friend and patron; 
Euler treated him all his life with a touching mixture of 
respect, esteem and affection. Perhaps he had started on 
his trip to Russia out of mere curiosity; but he stayed there 
until his death in 1764, at times in Petersburg and at times 
in Moscow. His chief hobbies were languages and mathe- 
matics, more particularly number theory, differential cal- 
culus and the theory of series; this made him an invaluable 
correspondent for Euler. 

Goldbach left Petersburg for Moscow early in 1728; his 
correspondence with Euler starts in 1729; at first it is con- 
ducted entirely in Latin and continues so, less actively of 
course, even after Goldbach’s return to Petersburg in 1732, 
at a time when they must have seen each other almost daily. 
Suddenly, on 21 August 1740, Euler drops into German 
with an urgent request for help (“Die Geographie ist mir fatal”: 
Corr.1.102); having already lost one eye, as he thinks, to 
cartographical work, he asks to be dispensed from a similar 
assignment which is not only alien to his regular duties but 
threatens to be fatal to his eyesight. Goldbach, justly alarmed, 
answers him on the very same day, also in German. In the 
following year Euler leaves for Berlin, and the correspon- 
dence continues steadily, basically in German, until close 
to the end of Goldbach’s life. Their German, however, is 
not only sprinkled with French words, but it tends to drop 
into Latin, especially in the mathematical passages. Here is 
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a specimen, taken from one of Euler’s first letters after his 
arrival in Berlin (Corr.I.105—107|9 Sept.1741); after reporting 
on the gracious reception granted to him by the king and 
the royal family and on various other matters, Euler goes 
on to describe his latest ideas about number theory and 
“eulerian” integrals: “Von den divisoribus quantitatis aa + mbb, 
sia et b sint numeri inter se prim, habe ich auch curieuse proprietates 
entdeckt, welche etwas in recessu zu haben scheinen .. .” (“about 
the divisors of a? + mb’, if a and b are mutually prime, I 
have discovered curious properties, where some secrets seem 
hidden . . .”]. The “curious properties” are then described 
in Latin (cf. infra, §§Vi and VIII). 


SIV. 


Number theory makes its appearance in Goldbach’s reply 
to Euler’s very first letter. On 13 October 1729, Euler, on 
Daniel Bernoulli’s advice, had sent to Goldbach some early 
results on what, since Legendre, has been known as the 
gamma function (Corr.I.3—7); obviously he had been reading 
Wallis’s Arithmetica Infinitorum (Wal.1.355—478). Goldbach’s 
answer, on 1 December, ts a cordial one, and competent 
enough, without bringing anything new; but it carries a 
fateful postscript: “Is Fermat’s observation known to you, 
that all numbers 2°” + 1 are primes? He said he could not 
prove it; nor has anyone else done so to my knowledge” 
(Corr.1.10). 

Actually Goldbach had never read Fermat (as he told 
Euler a little later: Corr.I.26) and was quoting him from 
hearsay; the same may be said of other references to Fermat 
in his correspondence (e.g. Corr.11.168,238]1725). In those 
days Fermat’s assertions seem to have circulated among 
“number-lovers” asa kind of folklore; Euler calls them “well- 
publicized” [in vulgus notas”: Eu.1-2.38 in E 98|1738]. 

At first Euler responded somewhat coolly to Goldbach’s 
question, by a mere expression of skepticism (Corr.1.18). 
But Goldbach insists (Corr.1.20), and suddenly, in June 1730, 
Euler catches fire. Not only is he now taking the matter 
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seriously, but he has taken up Fermat’s works and has started 
reading them (Corr.1.24). 

One may ask which of Fermat’s writings were accessible 
to him at the time. These included the Varia Opera of 1679 
and the Commercium Epistolicum of 1658 (containing some 
important letters by Fermat) which Wallis had inserted into 
his monumental Algebra of 1693; Euler quotes both in 1732 
(Eu.1-2.2 in E 26). Did he also have access to Fermat’s ob- 
servations in Samuel de Fermat’s Diophantus of 1670 (cf. 
Chap.II, 81)? He makes no mention of them until 1748, 
but then quotes them thrice; writing to Goldbach (Corr.1.445— 
446|1748), he mentions for the first time Fermat’s famous 
observation (Fe.1.291, Obs.II) about “Fermat’s equation” 
x” + y” = 2”; then in his next letter (Corr.1.455|1748) he 
copies out in full the observation about sums of four squares, 
etc., and Fermat’s intention to write a whole book on the 
subject (Fe.1.305, Obs.X VIII); in the same year he devotes 
a paper (Eu.1-2.223-240 = E 167|1748) to a diophantine 
problem stated by Fermat in the Diophantus (Fe.1.333, 
Obs.XL). Does it not look as if Euler had just discovered 
Fermat’s Diophantus at that time? 

At any rate, when Euler took up Fermat’s writings in 
1730, he found a statement there (apparently the one in 
Fermat’s letter to Digby, no.XLVII of the Commercium Ep- 
istolicum; Fe.11.403) which impressed him even more than 
the conjecture about 2°” + 1, just as it had impressed Huygens 
when it first appeared (cf. Chap.II, §XVII). As he tells 
Goldbach, Fermat had said that every number is a sum of 
four squares (“non inelegans theorema”, Euler calls it, somewhat 
inadequately: Corr.1.24) and had further statements about 
sums of triangular and pentagonal numbers, of cubes, etc., 
“whose proofs would greatly enrich analysis”. He had dis- 
covered a topic which was to haunt him all his life. 

Soon another letter from Goldbach sends him back to 
Fermat and to Wallis’s Algebra. Fermat had stated that a 
triangular number 3n(n+1) cannot be a fourth power 
(Fe.1.341, 11.406; cf. above, Chap.II, §X). Goldbach thought 
he had proved, in the Acta Eruditorum of 1724, that such a 
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number cannot even be a square, and he had communicated 
his “proof” to Nicolas and to Daniel Bernoulli in 1725 
(Corr.11.170,237,239) and to Euler in 1730 (Corr.1.34). Euler 
pointed out the error at once; if one puts x = 2n + 1, the 
question amounts to x” — 8y* = | and is thus a special case 
of “Pell’s equation”. “Such problems”, he writes (Corr.1.37), 
“have been agitated between Wallis and Fermat” (a clear 
reference to the Commercium) “. ..and the Englishman Pell 
devised for them a peculiar method described in Wallis’s 
works”. Pell’s name occurs frequently in Wallis’s Algebra, 
but never in connection with the equation x” — Ny* = 1 to 
which his name, because of Euler’s mistaken attribution, 
has remained attached; since its traditional designation as 
“Pell’s equation” is unambiguous and convenient, we will 
go on using it, even though it is historically wrong. Obviously 
the method referred to by Euler is the one Wallis credits 
to Brouncker (Wal.11.797; cf. Wal. 11.418—429, and above, 
Chap.II, §XIII). 

Euler must have been a careless reader in those days. 
Even so, another minor puzzle is hard to explain. Not only 
does Euler express the view that Fermat had not proved 
his assertion about sums of four squares (“neque demonstra- 
tionem ejus habuisse videtur”: Corr.1.35|1730; cf. Eu.1-2.33 in 
E 54/1736), but at first he seems convinced that Fermat 
never said that he had proved it (“neque ipse Fermatius de- 
monstrare se posse affirmat”: Corr.1.30|1730). In Fermat's writ- 
ings as published at the time, there are only two passages 
containing that statement. One, and presumably the one 
which Euler had noticed, is in Fermat’s letter to Digby, 
no.XLVII of the Commercium Epistolicum (Fe.11.403 = 
no.XLV1I in Wal.11.857—858) quoted by Euler in 1732 along 
with Fermat’s Varia Opera of 1679 as a reference for the 
conjecture about 2°" + 1 (Eu.I-2.2 in E 26|1732); the other 
is in Fermat’s Diophantus (Fe.1.305, Obs.XVIIJ); on both 
occasions Fermat declares with great emphasis that he pos- 
sesses complete proofs. In 1742, writing to CLAIRAUT, 
Euler not only describes the former passage correctly but 
expresses the belief that Fermat had indeed a proof, and 
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deplores the loss of his manuscripts: “Ce seroit un grand 
avantage ... si lon publioit ces démonstrations, peut étre que les 
papers de ce grand homme se trouvent encore quelque part” [“It 
would be an excellent thing if these proofs were published; 
maybe the manuscripts of this great man can still be found”: 
EuAV A-5.124]. When did Euler change his mind? 

No less surprising is the story of his re-discovery of Fermat's 
theorem a’~' = 1 (mod p), p being a prime and a prime to 
p. Euler was concerned with it as early as 1731. Later in 
life, coming back to it in the course of a systematic inves- 
tigation of the multiplicative group modulo p, he calls it 
“theorema eximium, a Fermatio quondam prolatum” [“an out- 
standing theorem once brought forward by Fermat”: Fu. I- 
2.510 in E 262|1755]. In his early days, however, we see 
him taking up the problem of the supposedly prime numbers 
2°" + 1 precisely as Fermat had done almost a century 
before, obviously in complete ignorance of what was readily 
available to him on pages 163, 177 of Fermat’s Varia Opera 
(Fe.11.209,198). He begins by observing that 2’"'—1 is a 
multiple of p when # is a prime, calling this “theorema non 
inelegans” on sending it to Goldbach in November 1731 
(Corr.1.60); then, also experimentally (“tentando”: Eu.1-2.1 
in E 26/1732), he finds that a?~' — b?~' is a multiple of p 
when a and 6 are prime to #, discovering at the same time 
the counterexample 2°° + 1 to Fermat's conjecture (cf. 
Chap.II, §1V). By June 1735, Euler has become vaguely 
aware at least of Fermat’s letter of 1640 to Mersenne 
(Fe. II. 195—199) when he writes to Ehler in Dantzig: “Theorema 

. non est novum, sed ni fallor a Fermatio assertum, at sine 
demonstratione, ex sola inductione” (“The theorem (2?7' = 1 
mod p) is not new; if I am not mistaken it was stated by 
Fermat, but without proof, as being obtained merely by 
induction”: PkU.295]; this, in the face of Fermat’s reiterated 
statement that he had found a proof (with some difficulty, 
“non sans pene”: Fe.11.198). Writing to Ehler, Euler gives a 
somewhat clumsy proof based on the application of the 
binomial formula to (1+ 1)’~' and a suitable re-arrangement 
of the terms; he reproduces this in 1736 but adds to it the 
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additive proof for Fermat’s theorem based on the binomial 
formula for (a+ 1)’ (Ew.I-2.33-37 = E 54/1736; cf. Chap.II, 
8IV). It is not until 1752, however, and perhaps not even 
then, that he appears clearly aware of Fermat’s priority in 
this matter (Eu.II-5.141 in E 182). Obviously Euler, in his 
younger days, had been far too preoccupied with his own 
thoughts to pay close attention to the legacy of his 
predecessors. 


§V. 


Sometimes the text of a play is preceded by a short sketch 
of the characters “in the order of their appearance”. Similarly, 
before embarking upon a detailed description of Euler’s 
arithmetical work, and at the cost of some repetition, we 
shall list here its main themes in chronological perspective, 
in the order in which they appear in his writings and his 
correspondence. Some of these might have been classified 
by him as analysis rather than number theory; nevertheless 
he was well aware of the connections between those fields. 
As he once observed while proving an arithmetical theorem 
by the method of formal power-series: “from this, one may 
see how closely and wonderfully infinitesimal analysis is 
related not only to ordinary [1.e. algebraic] analysis but even 
to the theory of numbers, however repugnant the latter 
may seem to that higher kind of calculus” [“ex hoc casu in- 
telligere licet, quam arcto et mirifico nexu Analysis infinitorum non 
solum cum Analyst vulgari, sed etiam cum doctrina numerorum, 
quae ab hoc sublimi calculi genere abhorrere videtur, sit contuncta”: 
Eu.1-2.376 in E 2431751; cf. infra, §XXI]. 


(a) Fermat’s theorem, the multiplicative group of integers modulo 
N, and the beginning of group theory (cf. 88VI,VII). 


This begins in 1730 with the investigation of Fermat’s 
conjecture about 2” + 1 (Corr.I.18|8 Jan.1730; cf. above, 
8IV) and the experimental discovery of Fermat’s theorem 
in 1731 or 1732; the additive proof for it is obtained in 
1735 (PkU.295). At about the same time Euler was re-dis- 
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covering Bachet’s solution of equations ax — by = m, or 
equivalently of congruences ax = m (mod 8), or of double 
congruences z = p (mod a), z = q(mod 6) (Eu.I-2.18-32 = 
E 36|1735), of course by the euclidean algorithm, in obvious 
ignorance of the fact that this had been known to Bachet, 
Fermat and Wallis, not to mention Aryabhata (cf. above, 
Chap.II, §XIII, and Chap.I, §IV). In our eyes its main feature 
is that it establishes the group property for congruence 
classes prime to an integer N modulo N by showing that, if 
a and 6 are prime to N, there is an x prime to N such that 
ax = b (mod N); indeed it is precisely thus that Euler came 
to use it in later years (cf. e.g. Eu.I-2.350 in E 242|1751, 
Eu.1-3.8 in E 283(1760, Eu.1-3.521 in E 5541772, etc.) 

For a more systematic investigation of the multiplicative 
group of integers modulo a prime p (the group mostly written 
nowadays as F,*) or more generally modulo an arbitrary 
integer N, one must then wait until Euler’s Berlin period, 
after he had found an incentive for this in his experimental 
results on quadratic forms mX® + nY*. From then on, Euler’s 
progress can be traced through a steady stream of publi- 
cations as well as in a posthumously published manuscript, 
the Tractatus, apparently composed around 1750 and then 
abandoned (Fu. I-5.182—283; cf. infra, V1). Here we mark 
merely some of the main stages of that progress: from 1747 
on, quadratic residues and m-th power residues; then the 
number of solutions of acongruencex”= | moduloa prime, 
at first (in 1749) by a difference argument; in the Tractatus, 
the multiplicative (i.e. group-theoretical) proof for Fermat’s 
theorem and its extension to “Euler’s theorem” a®™? = 1 
(mod N) for any a prime to N; finally, in 1772, the existence 
of ¢(p— 1) “primitive roots” modulo any prime p, with some 
of its main consequences. 


(b) Sums of squares and “elementary” quadratic forms (cf. 881X, 
XD. 


As we have seen (cf. §1V), Euler was early fascinated by 
Fermat’s assertions concerning sums of four squares, of 
three triangular numbers, etc. (Corr.1.24|25 June 1730). At 
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first, just like Fermat (cf. Chap.II, §V), he merely made the 
rather trivial remark that no integers of the form 4m + 3, 
or more generally n°(4m + 3), can be a sum of two squares, 
and no integer n°(8m+7) can be a sum of three squares 
(Corr.1.44|17 Oct. 1730); this is of course easily proved by 
writing congruences modulo 4 or 8. In 1741 he mentions 
having known “for a long time” that 4mn — m — | cannot 
be a square if m, n are positive integers (Corr.I.107), but it 
soon turns out (Corr.I.114) that in this he had relied upon 
Fermat’s statement that a’ + 6°, for a prime to 6, has no 
prime divisor of the form 4n — 1, from which indeed it 
follows easily. Only in 1742 is he able to send to Goldbach 
a proof for the latter fact (Corr.I.115), based of course on 
Fermat’s theorem a’~' = 1 (mod ). 

This was the first stage in a seven years’ campaign to 
prove all of Fermat’s statements about sums of two squares 
(cf. Corr.1.134]1742, 313|1745, 415-419]1747; Eu.J-2.295— 
327|20 March 1749), at the end of which Euler could trium- 
phantly report to Goldbach his final victory: “Now at last I 
have found a conclusive proof...” (“Nunmehr habe ich endlich 
einen biindigen Beweis gefunden .. .”: Corr.1.493—497|12 April 
1749; cf. Eu.1-2.328—337|1750). In the meanwhile, however, 
and from 1747 onwards, we find in Euler’s letters and in 
Goldbach’s replies more and more frequent references to 
sums of three and four squares, and, in connection with 
these, to orthogonal transformations in 3 and in 4 variables 
(cf. e.g. Corr.1.440|1747, 515-521]1750 and Eu.1V A-5.468, 
478—480|1770). The decisive discovery of the identity for 
sums of four squares is communicated to Goldbach in 1748 
(Corr.1.452; cf. Eu. 1-2.368—369 in E 242(1751 and Eu.1-6.312 
in E 407|1770). In the same letter of 1749 where Euler was 
sending to Goldbach the definitive proof for sums of two 
squares, he is also able to give a similar proof, based on his 
new identity, for the fact that every integer is at any rate a 
sum of four rational squares (Corr.1.495—497; cf. Eu.1-2.338- 
372 = E 242|1751). As to Fermat's original assertion about 
sums of four squares, it was left to Lagrange to take the 
final step in 1770 (Lag. {II.189—201); but soon Euler followed 
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this up with a beautiful memoir (£u.[-3.218-239 = 
E 445|1772) giving a new variant of his own proof for two 
squares and then showing how it applied not only to the 
forms X° + 2Y¥*, X° + 3Y* but also to sums of four squares. 
As to the “elementary” quadratic forms X* + AY? with 
A = +2, 3, whose theory, at least in its main outlines, had 
already been known to Fermat (cf. Chap.IJ, §XII), Euler 
had been studying them since 1752 (Corr.1.597|1752; cf. 
Eu.1-2.467—485 in E 2561753 and Corr.1.622|1755), finally 
achieving complete results for A = 3 in 1759 (Eu. 1-2.556—- 
575 = E 272\1759) and for A = +2 only in 1772 (Eu.I- 
3.274-275 in E 449|1772). 


(c) Diophantine equations of degree 2 (cf. §XIII). 


This topic, too, appears quite early in Euler’s correspon- 
dence with Goldbach (Corr.1. 30—31,36—37| 1730; cf. above, 
§IV), where in particular it is pointed out that infinitely 
many solutions of an equation y* = ax” + bx + cin integers 
can be derived from the knowledge (however obtained) of 
one solution, combined with the solution of “Pell’s equation” 
X? — aY*® = 1. Those results are described in E 29|1733 
(= Eu.J-2.6-17) and taken up again in fuller detail in 
F 279|1758 (= Eu.1-2.576—611; cf. Corr.1.629[1755) and in 
E 32311759 (= Eu.1-3.73-111), where the connection be- 
tween Pell’s equation X* — aY° = 1 and the continued frac- 
tion for Va turns up for the first time (cf. (f) infra). 

Equations y° = ax” + bx + c, to be solved either in fractions 
or in integers, re-appear in Euler’s Algebra, composed not 
later than 1767 (Vollsténdige Anleitung zur Algebra, 2 vol., St. 
Petersburg 1770: Eu.I-1.1-498 = E 387, 388) whose final 
section (Von der unbestimmten Analytic: Eu.1-1.326—498) was 
described by Lagrange, in a letter to d’Alembert, as “a treatise 
on diophantine equations, and in truth an excellent one” 
(“un traité sur les questions de Diophante, qua est a la vérité excellent”: 
Lag. X111.181/1770). Finally, in 1772, Euler extended his 
results to the most general equation of degree 2 in two 
unknowns, to be solved in integers (Eu.1-3.297-309 = 
E 452); curiously enough, in the introductory section of 
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that paper which deals with rational solutions, there is no 
reference to Lagrange’s important memoirs on that subject 
(Lag.11.377—535|1768, 655-726|1769; cf. above, Chap.I1, 
§XIV, and Chap.IV, 8III) which had been the subject of 
an exchange of letters between him and Euler in 1770 (Eu.TV 
A-5. 466—467,47 1—476,477; cf. above, §II). 


(d) Diophantine equations of genus 1, and others (cf. 88XIV, 
XV). 


Such questions, including the traditional methods of so- 
lution which go back to Diophantus (cf. Chap.I, §X, and 
Chap.II, §XV) may well have been familiar to Euler from 
his student days. In 1724 we find his friend Daniel Bernoulli 
instructing Goldbach in the use of those methods and men- 
tioning his own early interest in such matters’ (Corr. 1J.202— 
203|1724; cf. ibed.190|1723). At that time Daniel did not 
know yet that the iteration of the traditional method yields 
infinitely many solutions (“quod impossibile censeo”: 
Corr.II.203); in 1730 he had found this out (Corr.11.356). 
By that time one may imagine him and Euler discussing 
such problems together; in 1730 Euler mentions a fruitless 
attempt to extend to equations y* = P(x), with a polynomial 
P of degree 3, the idea which he had just applied successfully 
to the case when P is of degree 2 (Corr.1.37). 

Some years later Euler devotes a whole memoir (Fu. I- 
2.3858 = E 98)1738) toa detailed treatment of the equations 
xi t+ yi = 22, x4 + y* = 22? (cf. Daniel Bernoulli's letter to 
him, Corr.11.45 1/1738); he refers to Frenicle’s Traité des Tri- 
angles (cf. Chap.II, §X), reprinted in the Mémoires of the 
Paris Academy, vol. V (1729) and perhaps freshly arrived 
in Petersburg, but is clearly unacquainted with Fermat’s 


"In Corr.11.202, “sextum vel septimum” is surely a mistake, or misprint, 
for “dectmum sextum vel septimum”; cf. Corr.{1.190. Daniel could hardly 
have been solving difficult diophantine equations when he was “six or 
seven years old”. 
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own proof (Fe.1.340-341, Obs.XLV; cf. Chap.II, $X). He 
also adds the proof that no triangular number can be a 
fourth power (cf. above, SIV, and Chap. II, $X) and a treat- 
ment of the equatioas x* + | = y’. 

Eventually he became aware of Fermat’s assertion about 
the equation x” + y" = 2” (Corr.1.445—446|1748); “a very 
beautiful theorem”, he calls it in 1753 (Corr.1.618), adding 
that he can prove it for n = 3 and n = 4, but not otherwise 
(cf. Corr.1.623|1755). For n = 4 this was in fact included in 
his results of 1738. For n = 3 his proof must have been the 
one combining Fermat’s descent with the theory of the 
quadratic form X? + 3Y? (cf. infra, §XIV); he had already 
made essential use of the “elementary” quadratic form 
X*® — 2Y? in dealing with a diophantine problem originating 
from Fermat (Eu.1-2.223-240 = E 167|1748). 

Also during the fifties, Euler studied Diophantus carefully, 
hoping to bring some order into what had all the appearances 
of a haphazard collection (£u.I-2.399-458 = E 253, 
255|1753—54). He did succeed in isolating a few of the Greek 
geometer’s favorite tricks, but otherwise the result of his 
efforts was disappointing; only the later developments of 
algebraic geometry were to throw light on a subject which 
was not yet ready for even partial clarification. Perhaps this 
is why Euler’s interest in it seems to have flagged during 
the latter part of his stay in Berlin. He came back to it only 
by composing the last section of his Algebra, an excellent 
treatise indeed, as Lagrange had called it, but containing 
little new material if any. 

From that time on, diophantine problems seem to have 
become one of Euler’s chief amusements. Noteworthy is a 
series of papers, all written in 1780, and all published only 
in 1830 (Eu.1-5.82—-115,146-181 = E772,773,777,778), in 
which diophantine equations P(x, y) = 0, where ® is a poly- 
nomial of degree 2 in x and of degree 2 in y, are considered 
for the first time; it is at this point that the relationship, in 
Euler’s work, between that subject and elliptic integrals ap- 
pears most clearly (cf. §§XV,XVI). 
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(e) Elhptic integrals (cf. $XV). 


As Euler knew, Leibniz and Joh. Bernoulli had already 
asked whether the differential 


dx 


a ed 
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can be integrated by means of logarithms or inverse trig- 
onometric functions and had guessed that it could not. When 
Euler asked the same question in 1730 (Corr.1.47,51) and 
then, in 1738, gave a proof for Fermat’s theorem about the 
diophantine equation X* — Y* = Z?, it must surely have 
occurred to him that any substitution transforming @ into 
a rational differential might well supply rational solutions 
for 2? = 1 — x‘, and hence integral solutions for Fermat’s 
equation. To his mind this gave perhaps some added flavor 
to integrals such as fw and the more general ones arising 
from the rectification of the ellipse, hyperbola and lemniscate 
(cf. e.g., Eu.I-20.8-20 = E 52/1735 and Eu.1-20.21-55 = 
E 154| 1749); he was also meeting such integrals, in connection 
with problems in applied mathematics, in his investigations 
on the so-called “elastic curves” (cf. his letter to Joh. Bernoulli 
of 20 Dec.1738, Brbl.Math.(11]) 5 (1904), p. 291, and the 
appendix De curvis elastics to his treatise of 1744 on the 
calculus of variations, Eu.I-24.231—297 in E 65). 

It is only in 1751, however, that FAGNANO’s Produzioni 
Matematiche, reaching him in Berlin, opened his eyes to the 
fruitful field of investigation that lay unexplored there. Al- 
ready a month after receiving those volumes, he was reading 
to the Berlin Academy the first memoir (Eu. 1-20.80-107 = 
E 252/1752) of a long series (from Eu.1-20.58-79 = E 251 
to Eu.1-21.207-226 = E 676|1777), at first only on the in- 
tegrals considered by Fagnano, but leading up to the proof 
of the addition and multiplication theorems for integrals 
of the form 


F(x)dx 


VP(x) 
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where P is a polynomial of degree 4 and F an arbitrary 
polynomial or even a rational function. At the same time 
he was obtaining the general integral of the differential 
equation 

dx dy 


= ——— 
V P(x) VP(y) 

in the “canonical form” P(x, y) = 0, with ® of degree 2 in 

x and of degree 2 in y. As noted above in (d), this was later 

transported to the theory of diophantine equations. 


(f) Continued fractions, Pell’s equation and recurrent sequences 
(cf. $XI1). 


In Euler’s letters to Goldbach, continued fractions make 
their first appearance in connection with Riccati’s differential 
equation (Corr.1.58—59|1731, 63|1732), but soon also in an 
arithmetical context (Corr.1.68|1732); already at that time 
Euler was noticing the relation between periodic continued 
fractions, “recurrent” sequences (i.e. sequences (p,) such 
that p, = ap,-1 + bp,-» for all n > 2; cf. Corr.1.30/1730) 
and quadratic irrationals (cf. Corr.1.36|1730). Of course 
his use of the euclidean algorithm for solving equations 
ax — by = +1 (Eu.1-2.20-21 in E 36]1735; cf. (a) above) 
also amounts to constructing the continued fraction for a/b 
(cf. Corr.1.243]1743, 299-301|1744), as may well have been 
obvious to him from the beginning and as he was to spell 
out in 1737 (Eu.1-14.194—196 in E 71) and again in Chapter 
XVIII (De fractionbus continus, Eu.1-8.362—390) of his great 
Introductio in Analysin Infinitorum of 1748. 

For many years he pursued this theme in its various aspects 
(cf. Eu.I-14.187-215 = E 71|1737, Eu.J-14.291-349 = 
E 123]1739, and Eu.I-15.31-49 = E 281}1757, supple- 
menting the above-mentioned chapter XVIII of the In- 
troductio); but the role played by the continued fraction al- 
gorithm in the solution of Pell’s equation was only clarified 
by him in a memoir of 1759, first published in Petersburg 
in 1767 (Eu.1-3.73—-111 = E 323) which may well have given 
Lagrange the final clue for his definitive treatment of that 
problem (cf. Chap.IV, $1). 
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(g) Summation of ((2v) and related series (cf. §8XVIH,XVIII, 


> 


One of Euler’s most sensational early discoveries, perhaps 
the one which established his growing reputation most firmly, 
was his summation of the series 2{n~? and more generally 
of Xjn~*, ie. in modern notation {(2v), for all positive even 
integers 2v (Eu.I-14.73-86 = E 41|1735). This was a famous 
problem, first formulated by P. MENGOLI in 1650 (cf. G. 
Enestrém, Bibl.Math. (III) 12 (1912), p. 138); it had resisted 
the efforts of all earlier analysts, including Leibniz and the 
Bernoullis (cf. e.g. Leibniz’s letter to Joh. Bernoulli, 
Math.Schr.1-3.454|1697, and Joh. Bernoulli’s letter to Euler, 
Corr.11.15|1737). Characteristically, before solving it, Euler 
had engaged in extensive numerical calculations in order 
to get good approximate values for these sums; it is largely 
with this intention, it seems, that he had developed the 
method traditionally known as “the Euler-MacLaurin sum- 
mation formula”, and in so doing had re-discovered the 
“Bernoulli numbers”, whose true importance for number 
theory was not to emerge until the next century (Eu.I-14.42— 
72 = E 25|1732, Eu.1-14.108-123 = E 47|1735; cf. Part 
V, Chap.II, of the Institutiones Calculi Differentialis of 1755, 
Eu.1-10.309-336). 

What Euler found in 1735 is that {(2) = 7?/6, and more 
generally, for v >= 1, £(2v) = r,7*”, where the 7, are rational 
numbers which eventually turned out to be closely related 
to the Bernoulli numbers. At first Euler obtained the value 
of ¢(2), and at least the next few values of {(2v), by a somewhat 
reckless application of Newton’s algebraic results, on the 
sums of powers of the roots for an equation of finite degree, 
to transcendental equations of the type 1 — sin (x/a) = 0. 
With this procedure he was treading on thin ice, and of 
course he knew it. 

Publication in Petersburg was slow; but soon Euler had 
sent the news to friends and colleagues all over Europe, 
and it became the topic for lively discussions between him 
and some of his correspondents. In the meanwhile he was 
sparing no efforts to consolidate his methods. Less than ten 
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years after his first discovery, he was able to include in his 
Introductio in Analysin Infinitorum (published in Lausanne in 
1748, but ready in manuscript in 1744; cf. Corr.1.292)1744) 
a full account of the matter, entirely satisfactory by his stan- 
dards, and even, in substance, by our more demanding 
ones. This was based on a detailed treatment of trigonometric 
functions and of their expansions into infinite series and 
infinite products (Eu.1-8.133-212 = Chap. VIII-XI of the 
Introductio), to which he added a whole chapter on the “eu- 
lerian” products for ¢(s) and various L-series (ibid.284—312 
= Chap.XV). As to his efforts to gain information about 
the numbers ¢(n) for odd n > 1, it is no wonder that they 
remained unsuccessful, since hardly any progress has been 
made with this problem down to the present day. Never- 
theless they were not unfruitful; it was in fact while searching 
for such information that Euler hit upon the functional 
equation for {(s) and for the companion series 


x 


_ (- 1)" 
L(s) = 2 (2n + 1) 


(Eu.I-15.70-90 = E 352|1749). These remained as mere 
curiosities, and virtually unknown, until they received new 
life at the hands of Riemann in 1859. 


(h) “Partitio numerorum” and formal power-series (cf. §X XI). 


In 1740 a Berlin mathematician of French origin, Ph. 
NAUDE, wrote to Euler, asking (among other questions) 
in how many ways a given integer m can be written as a 
sum of yw distinct integers. Euler answered almost imme- 
diately (PkU.193) and followed this up within a few months 
with a memoir presented to the Petersburg Academy only 
shortly before taking his leave (Eu.1-2.163-193 = 
E 158|6 April 1741). He was to come back to this topic on 
several occasions, treating it in some detail in a chapter of 
his Introductio of 1748 (Chap.XVI = Eu.I-8.313-338) and 
for the last time in 1768 (Eu.I-3.131-147 = E 394). 

He had seen at once that the key to Naudeé’s question lay 
in the consideration of suitable formal power-series such 
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as 7 \(1 + xz) and II. ,(1 — x’z)~'. In dealing with them 
he soon came across the remarkable infinite product 


T]d -x)=1l-x-x 4x? tx? — +x?! + ete. 

7=1 
(Eu.1-2.191 in E 158!1741), which, expanded into a power- 
series as shown by the above formula, could not fail to strike 
him; not only are all the non-zero coefficients equal to + 1, 
but the exponents are easily recognizable as “pentagonal 
numbers” 3n(3n + 1). It took Euler almost ten years before 
he was able to send a proof for this to Goldbach (Corr. 1.522— 
524|1750). 

As Jacobi found out, a full explanation of such matters 
has to be sought in the theory of theta-functions and of 
modular forms. This lay far beyond Euler’s ken. Nevertheless 
one cannot but admire his power of divination when he 
tells Goldbach that the most natural method for proving 
Fermat’s theorem on sums of four squares would be by 
considering the fourth power of the series 27_, x" (not quite 
the theta-series 3"_.. x’”, but close to it). This, too, had to 
wait for Jacobi. 


(2) Prime divisors of quadratic forms (cf. 88VILLIX). 


The same letter to Goldbach, quoted in §III, and written 
by Euler almost immediately upon his arrival in Berlin 
(Corr.1.105—107|9 Sept. 1741), contains some momentous 
mathematical observations: firstly, as mentioned above in 
(b), the result about 4mn — m — 1, 1.e. in substance about 
the divisors of the quadratic form X* + Y*; then a number 
of statements, obtained “by induction” (i.e. experimentally) 
on the prime divisors of the forms X* —- 2¥Y”, 
xX? — 3Y°, X® — 5Y*; and finally his first results about the 


integrals 
{ dx 
6 é _ x" yr"4 


(later known as “eulerian integrals of the first kind”). 
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As he wrote to Clairaut a few months later, in response 
to a friendly word of enquiry about his occupations in Berlin, 
he was “enjoying perfect rest” there [Je jouis ... @un parfait 
repos”: EuAV A-5.110|fan.—Feb. 1742] while waiting for the 
king to pay some attention to his academy; obviously he 
turned this period of leisure into one of intense creativity, 
particularly in the field of number theory. Within a year 
he had extended his observations to a large assortment of 
quadratic forms X? + AY*, A being a positive or negative 
integer (Corr.1.146—150|28 Aug.1742, 161—162|27 Oct. 1742; 
cf. ibid.260|15 Oct.1743). Already in August 1742 he was 
quite close to the law of quadratic reciprocity and empha- 
sizing the value of his discoveries: “I feel sure”, he wrote, 
“that I am far from having exhausted this topic, but that 
innumerably many splendid properties of numbers remain 
still to be discovered” [“Tch glaube aber fest, dass ich diese Materie 
bet weitem noch nicht erschépfet habe, sondern, dass sich darin 
noch unzihlig viele herrliche proprietates numerorum entdecken 
lassen. ..”: Corr.1.150). He rightly attached such importance 
to his conjectures that he never tired of repeating them to 
the end of his days, in more or less precise form, while 
making little progress towards proving them (cf. however 
Eu.1-3.280—281 in E 449|1772, and Eu.I-4.163-196 = 
E 598|1775, the latter memoir being based on Lagrange’s 
Recherches d’Arithmétique of 1775). A clear formulation of 
the law of quadratic reciprocity appears in the conclusion 
(Eu.1-3.512 in E 552|1772) of a paper written in 1772 and 
published in Euler’s Opuscuda Analytica in 1783, the year of 
his death. 


(j) Large primes (cf. $X). 


Already in 1730, when taking up the Fermat numbers 
2°" + 1, Euler had met with the problem of deciding whether 
a given large number is a prime. As Fermat had known (cf. 
Chap.I, §1X), a non-square integer of the form 4n + 1 is 
a prime if and only if it has one and only one representation 
as a sum of two squares and for this representation the two 
squares are mutually prime. The first mention of Euler’s 
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rediscovery of this criterion occurs in 1742 (Corr.1.134— 
135). In 1749, just before completing the proofs about sums 
of two squares (cf. (b) above), Euler applied this criterion 
to various primes and non-primes (E£u.1-2.321-327 in 
E 228). The same method is described more systematically 
in E 283]1760 (= Eu.I-3.1-45) and in E 369|1765 (= Eu.I- 
3,112—130); in the former paper Euler examines more spe- 
cifically the primes of the form a? + 1 and also gives his 
method (based on the euclidean algorithm; cf. (a) and (6) 
above) for solving the congruence x” = — 1 modulo a given 
prime p = 4n + 1. 

Finally, in his old age, Euler generalized Fermat’s criterion 
to a large number of quadratic forms mX* + nY?, com- 
municating this new method to his friend and former Berlin 
colleague, the Swiss mathematician Béguelin (Eu.1-3.418~ 
428 = E 498 and 708a|1778; cf. Eu.I-4.269-289,303— 
328,352-398 = E 708,715,718,719,725|1778). This was 
based on the concept of “numerus idoneus”, i.e. “suitable 
number”. According to the original, somewhat inexactly 
formulated definition, a number N 1s called “suitable” if 
(and only if) every odd integer which can be written in one 
and only one way as a> + NO’, with a prime to 3, is prime; 
it is asserted that all the forms mX* + nY* with mn = N 
have then the same property. Eventually this is modified 
by specifying that certain trivial cases have to be excluded. 
Euler also found at least a heuristic method for deciding 
whether an integer N is “suitable” in his sense (i.e., suitable 
for testing the primality of large numbers by a criterion 
generalizing Fermat’s); he listed 65 such integers, from | 
to 1848, and used them to discover some large primes (larger 
then 10’) which had not been known before. 

In his Disquisitiones (Art.303), Gauss clarified the matter 
somewhat by indicating that Euler’s numeri idonet are 
precisely those determinants of binary quadratic forms for 
which there is only one class in each genus. Euler had 
already encountered questions pertinent to the theory of 
genera when he had sought to investigate which integers 
can be written as mx* + ny® with rational x, y, when m and 
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nm are given integers (Eu.I-4.1-24 = E 556|1772; cf. 
Corr.1.605|1753). 


§VI. 


We begin with Fermat’s theorem (cf. above, §IV, and 
Chap.II, §1V). Fermat had discovered it while trying to 
factor such numbers as 2” + 1, 3” + 1, etc. As we have 
seen, it can be formulated and proved either additively, the 
proof being then based on the binomial theorem, or mul- 
tiplicatively. Fermat’s final formulation for it (Fe.11.209) had 
been of the latter type, and it may be surmised that he had 
proved it accordingly, even though he had himself discovered 
enough about binomial coefficients (cf. Chap.II, §11) to be 
capable of the other proof. 

Euler’s interest in this matter arose from Goldbach’s in- 
sistence upon Fermat’s conjecture about the numbers 
2°" + 1 (cf. above, §IV). Treading unwittingly in Fermat’s 
footsteps, he began by discovering the theorem experi- 
mentally, at first in the case of 2?~' — 1, then in general 
(cf. §1V). In contrast with Fermat, however, he had ab- 
sorbed the binomial theorem almost with his mother’s 
milk; it is not surprising, therefore, that the first proof he 
obtained was the additive one (Eu.1-2.33-37 = E54 | 1736; 
cf. PkU.295)1735). 

Once in Berlin, when he took up in right earnest the 
investigation of sums of two squares (cf. infra, §1X), Fermat’s 
theorem became one main ingredient in the proof of his 
first results (Corr.1.115—116|1742; cf. Chap.II, §VII, and 
infra, §1X); but this may not yet have provided an incentive 
for him to seek the proof of Fermat’s theorem in the mul- 
tiplicative properties of integers modulo a prime. One is 
tempted to look for such an incentive in the “herrliche pro- 
prietates” discovered empirically by him in that same year 
1742, properties which he described to Goldbach with even 
more than his wonted enthusiasm (Corr.1.146—150/28 
Aug.1742); there, having investigated quadratic forms 
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X? + NY? and examined, for various values of N, the dis- 
tribution modulo 4N of the prime divisors of that form (1e., 
in modern parlance, the rational primes which split in the 
field Q(V —N), this is what he found (Corr.I.148): “If a 
prime of the form 4Nn + s is a divisor of X®? + NY?®, then 
so is every prime of the form 4Nn + s*; if two primes 
4Nn + s,4Nn + t are such, then so is every prime of the 
form 4Nn + s‘t'”. In fact (cf. infra, §VIID he was discovering, 
not only what is in substance the law of quadratic reciprocity, 
but at the same time the concept of a subgroup of the mul- 
tiplicative group modulo 4N, and more precisely of a 
subgroup of index 2 of that group. 

Whether or not these observations gave Euler the primary 
motive for exploring group-theoretical concepts further, 
he obviously gave much thought to them, from 1745 to the 
end of his life. Concurrently he became aware of Fermat’s 
authorship of his theorem (cf. above, §IV); perhaps he was 
also impressed by Fermat’s distinctly multiplicative for- 
mulation. However that may be, it is fascinating to watch 
his progress towards a more and more conceptual approach 
to number theory (belying the legend that he was an em- 
piricist not much given to abstract thinking), a progress 
evidenced by a vocabulary which became more and more 

“modern” as years went by. 
_ Already in 1747, while still sticking to the additive proof 
for Fermat’s theorem (£u.I-2.65—-67 in E 134), he finds it 
worthwhile to point out that, in matters pertaining to di- 
visibility by p (where p need not be a prime), integers a, 6, 
etc., may be replaced by a + ap, b + Bp, etc., i.e. by any 
integers leaving the same residues in their division by p 
(thid., p.77, Scholion); he discusses “residues arising from 
the division of squares” (or of cubes, etc.) and begins to 
consider the properties of such residues which arise from 
Fermat’s theorem. In particular he finds (:did., p.81, th.13) 
that “if a — f” is divisible by the prime p = mn + 1, then 
so is a” — |” (an easy consequence of Fermat’s theorem), 
and at once proposes the question of proving the converse, 
which he has found invariably true (zbzd., p.82, Scholion). 
Soon after that, in his first major paper on sums of two 
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squares (Eu.1-2.295-327 = E 228/1749), we find a more 
careful discussion of quadratic residues, still under the 
somewhat cumbersome designation as “the residues arising 
from the division of squares by the prime p” (zbid., pp.3 12— 
313, Scholion); it is noted there that the number of such 
residues is exactly 3(p— 1), and it is conjectured that, for 
p = 4n + 1, they are precisely the solutions of x*"= 1 (mod p), 
this being a special case of the conjecture described above 
(with m, n replaced by 2n, 2); it is the decisive case for the 
theory of sums of two squares. 

Stull in 1749, further progress is made (Corr.1.493—495, 
cf. Eu.1-2.328-337 = E 241|1750) by applying the difference 
operator to the congruence x*" = | (mod p) in order to show 
that it cannot be satisfied for all x prime to the prime p = 
4n + 1. As generalized by Euler a few years later (Eu.I- 
2.516 in E 262|1755), the argument is as follows. Call D the 
difference operator 


(Df\(x) = fle+ 1) — fx). 


By induction on m, one finds, for any m 2 |: 


(D"f\(x) = fix+m) — (7 \fecrm- 1) 
+ (2)ferm—2) +(x). 


Therefore, if a prime p divides f(x) for x = 1, 2, ...,m + 1, 
it must divide (D"f)(1). Also by induction on m, one finds 
that, if f(x) = x", D"fis m! for w = mand 0 for uw < m; this 
gives D"f = m! for every monic polynomial 


fix) = x" + ax) +e + ay, 


Consequently, if m < p — 1, the congruence f(x) = 0 (mod p) 
cannot be satisfied for x = 1,2, ..., m+1, nor still less for 
all x prime to p. 

As will be seen presently, this ingenious argument was 
to be superseded later by a more powerful and more struc- 
tural one. When it first appears in the above form, it is used 
for settling the question left open in 1747, by showing that, 
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if p = mn + Lisa prime and a” — 1 is divisible by p, then 
a is an n-th power residue modulo p (loc.cit.pp.515-516, 
th.19). In fact, assume that a" = 1 (mod p) and write the 
identity: 


xm _ a” _ (x” — a) (x PP $a 2B ee peQ™), 


Fermat’s theorem shows that the left-hand side is a multiple 
of p for all x prime to p, while Euler’s argument shows that 
the second factor in the right-hand side cannot have that 
property; therefore one must have x"= a (mod ) for some 
x. Puta = —] and n = 2, and the proof reduces to the 
one reproduced above in Chap.II, §VII, which Euler had 
sent to Goldbach in 1749 (Corr.1.494—-495; cf. above, §V48, 
and infra, §1X); this had been the missing link in his earlier 
attempts to prove Fermat’s theorems on sums of two squares. 

Greatly encouraged, no doubt, by this success, and having 
seen through the press his /ntroductio in Analysin Infinitorum 
in 1748, Euler then’ set to work on a treatise, perhaps in- 
tended as a similar “introduction” into number theory, for 
which he wrote sixteen chapters before giving up the attempt; 
these were published in 1849 under the title Tractatus de 
numerorum doctrina (Eu.1-5.182—283 = E 792); in part this 
almost looks like a first draft for sections I, Il and II of 
Gauss’s Disquisitiones. It begins with a lucid exposition of 
some elementary matters, chiefly (Chap.2—4) the calculation 
of the functions now usually denoted by a(n), a(n), o(n), 
ie. respectively the number of divisors of n, their sum (fn 
in Euler’s notation), and the number of integers prime to 
n and less than n (the “Euler function”, for which Euler 
later used the notation mn; cf. e.g. Eu.J-4.105-115 = 
E 564|1775). Then it proceeds (Chap.5, nos.140—166) to 


? Not earlier than 1749, since the use of the difference operator, dis- 
covered in that year, figures prominently there (£u.1-5.222, nos.256— 
259). It should be observed that the marginal notes added to the first 
draft of the treatise, and inserted as Additamenta in Eu.{-5, are after- 
thoughts, presumably of later date; e.g. the Additamentum to no.565, 
p.278, rejects a wrong proof in the text. 
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an elementary discussion of what since Gauss has been known 
as congruences with respect to a modulus; Euler’s word for 
the modulus, which he uses systematically in that sense, is 
“the divisor”. For a given divisor d, all integers r + dx are 
said to belong to the same “class” (no.14) and are to be 
regarded as “equivalent” (no.154). Any representative for 
the class of an integer a is called “a residue” of a, this term 
being sometimes (but not always) confined to the residue r 
of the division of a by d defined by a = r + dx,O<r<d. 
It is also specified that the mapping from the integers to 
their “residues” (in the sense of the equivalence of those 
residues) has the properties characterizing what we now 
call a ring-homomorphism. 

The next chapter (Chap.6) gives a treatment of the residues 
which arise, for a given “divisor” d, from the integers in an 
arithmetic progression a,a + b,a + 26, etc., or, in modern 
language, of the cosets of the subgroup generated by 6 in 
the additive group modulo d; of course this amounts to the 
determination of the g.c.d. of b and d, and to the solution 
of congruences bx = m (mod d), which had been the subject 
of one of Euler’s earliest papers® (Eu.I-2.18-32 = E 36|1735; 
cf. above, §Va). This is followed by the corresponding in- 
vestigation for the multiplicative group modulo d (Chap.7); 
it is shown that if 6 is prime to d, the residues arising from 
the progression 1, b, 6”, etc. make up a set closed under 
multiplication and division modulo d, i.e. in effect a group; 
from this, by considering its cosets in the multiplicative group 
of integers prime to d modulo d, it is shown that its order, 
i.e. the smallest integer vy > 0 such that d divides 6” — 1, 
must divide the order w = ¢(d) of the latter group. Therefore 
d must divide 6* — 1. 

That is Euler’s famous theorem; when d is a prime, it 
reduces to Fermat’s (cf. Chap.I], §1V). In later years, when 


* Oddly enough, no justification is given in the text of Chapters 1 to 
5 for the unique decomposition of integers into primes. The euclidean 
algorithm is mentioned there only as an afterthought, in the Additamenta 
to nos. 139 and 150. 
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publishing this proof (Eu.I-2.493-518 = E 262]1775 for 
the case of a prime “divisor”; Eu.I-2.531-555 = E271|1758 
in the general case), Euler not only insisted that this was a 
substantial generalization, justifying the introduction of the 
“Euler function” g(n) “which otherwise might seem sterile” 
(Eu.1-2.555, Scholion), but also pointed out that, even for 
d prime, the above proof is better than the “additive” one; 
it is “more natural”, he says, since the latter depends upon 
the binomial formula which seems alien to the subject (“a 
proposito non mediocriter abhorrere videtur”: Eu.1-2.510, Scho- 
lion). Incidentally, he also notes that g(N) is not in general 
the smallest exponent 7 such that N divides x" — 1 for all 
x prime to N; e.g. if N = p%q’r’, where p, q, r are distinct 
primes, that property belongs, not to the product of 
Pp '(p-1), FB '(q- 1), P '(r- 1), which is g(N), but to their 
Lem (Eu.1-2.532, Summarium); he had already made a 
similar observation in his very first paper on number theory 
(Eu.1-2.4 in E 26]1732) without being able to prove it. 

Of course congruences are most interesting when the 
modulus, or in Euler’s language the divisor, is an odd prime 
p, and we soon find him, in the Tractatus, concentrating on 
that case (Chap.8). He begins by showing that, if a is prime 
to p and p divides a” — 1, then, since p also divides a’~' — 1, 
it divides at — 1 where d is the g.c.d. of n and p — 1; in 
effect, he uses here the fact which would now be expressed 
by saying that the group of residues of powers of a modulo 
pis a homomorphic image of the additive group of integers 
modulo p — |. From this he deduces that, if » is itself an 
odd prime and p divides a" — | but nota — 1, then p must 
be of the form 2nx + 1; the case a = 2 is the relevant one 
for the investigation of perfect numbers. 

Stull following Fermat, consciously or not (cf. Chap.I], 
SIV), Euler observes next (no.255) that, for p = 2m + 1 
and a prime to p, p divides 


ae” _ 1 _ (a"— 1a" + 1) 


and must therefore divide either a” — 1 or a” + 1 but not 
both; and he asks whether it could divide a” — J for alla 
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prime to p. At this juncture he introduces his newly dis- 
covered method of finite differences to show that this can 
not happen; more generally, he uses it to show that, if 
n<p — 1,p cannot divide a” — 1 for all a prime to p. 

Here he narrowly missed an opportunity for showing the 
existence of “primitive roots” modulo an arbitrary prime, 
ie. of an element generating the multiplicative group modulo 
p, with all the consequences which he was later to deduce 
from this fact. Indeed, with the means already at his disposal, 
it can be shown easily that, if a and 6 are elements of that 
group (or in general of any finite commutative group), there 
is always an element a”b” whose order is the l.c.m of those 
of a and of b. Thus, if 7 is the greatest of the orders of the 
elements of that group, it must also be a multiple of the 
orders of all its elements, so that p must divide a” — 1 for 
all a prime to p; thus n cannot be <p — 1. 

Partly for lack of this result, most of the remainder of 
the Tractatus is tentative, obscure or just plain wrong. Its 
aim was clearly to give, in what would now be called group- 
theoretic terms, an account, firstly of quadratic residues, 
and then of m-th power residues modulo a prime f, at least 
for m = 3, 4, 5. Euler saw that, for each m, such residues 
make up a set closed with respect to multiplication and 
division: “a truly remarkable property”, he calls it (“imsegnem 
proprietatem”: no.295), having already, as we have seen, en- 
countered its analogue in connection with quadratic forms 
(cf. infra, §VII1). In our language, this is to say that the 
m-th power residues make up a subgroup G,, of the mul- 
tiplicative group G modulo p; Euler even discusses, in terms 
of the cosets of G,, in G, what is now called the factor-group 
G/G,,, but, except for m = 2 when G,, is of index 2 in G, he 
fails to determine its structure or even its order; his attempt 
to show, for m = 3, 4 and 5, that, if m divides p — 1, G/G,, 
is cyclic of order m does not even make sense. Clearly his 
project was premature and had to be given up. 

As usually happened with Euler, however, nothing was 
lost; whatever was most valuable in the Tractatus was to re- 
appear in the publications of the next decade. The greater 


196 Number Theory Chap.HI, §VI 


part (pp.339—365 in Eu.I-2.338-372 = E 242) of his paper 
of 1751 on sums of four squares is devoted to a careful 
exposition of the elementary properties of quadratic residues 
and non-residues, at first without assuming that “the divisor” 
(i.e. the modulus) is a prime. When it is an odd prime p, 
Euler shows that the residues make up a subgroup of index 
2 of the multiplicative group modulo f, with all that this 
implies, which to us seems so elementary as to appear trivial 
but in his days was new and needed spelling out. In particular 
he can now point out that, if —1 is a quadratic residue 
modulo p = 2n + 1, then quadratic residues must occur 
in pairs {r, —7}, so that n must be even; as to the converse 
he merely notes that he has looked in vain for a “direct” 
proof (i.e., apparently, one not depending upon the dif- 
ference operator: p.365, Scholion). 

His ostensible purpose in that paper had been to deal 
with sums of four squares (cf. infra, §X1). Then, for a few 
years, the Tractatus seems to have been set aside; the next 
paper to be based on it, significantly entitled Theoremata 
circa residua ex divisione postestatum relicta (i.e. theorems on 
m-th power residues modulo a prime: Eu.1-2.493-518 = 
E 262) was read to the Berlin academy in 1755; it describes 
Euler’s latest version (the one reproduced above) of his 
method of finite differences, with his criterion a” = 1 (mod p) 
for an integer a to be an m-th power residue modulo a prime 
p = mn + 1. Next in line comes the paper entitled Theoremata 
arithmetica nova methodo demonstrata (Eu.1-2.531-555 = 
E 271(1758), based this time on Chapters 4 to 7 of the Trac- 
tatus; the “new method” is here the group-theoretic inves- 
tigation of the additive and multiplicative groups modulo 
an arbitrary integer N, with the calculation of the order 
¢(N) of the latter group, all this leading up to Euler’s theorem 
that N divides a*) — 1 for all a prime to N. 

Perhaps nothing more could then have been extracted 
from the Tractatus, apart from some conjectures related to 
the quadratic reciprocity law and a few bold inductions about 
cubic and biquadratic residues, a proof for which lay well 
beyond Euler’s powers then and later. For some time his 
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interest in such matters seems to have slackened; to revive 
it, some outside impulse was needed. Lagrange was the one 
to supply it. 

For many years Goldbach had been alone in taking an 
interest (an amateurish one, but real nonetheless) in Euler’s 
arithmetical work. Their correspondence was almost com- 
pletely interrupted between 1756 and 1762 by the Seven 
Years’ war; in the meanwhile Goldbach was getting old; he 
died in 1764. Lagrange’s correspondence with Euler begins 
in 1754 (cf. Chap.IV, $1) when he was only 18 years old; 
his first letter seems to betray a raw beginner, scarcely worthy 
of Euler’s attention; characteristically Euler preserved it, 
although he did not answer it. Lagrange’s next letter, how- 
ever, written a year later, and outlining an altogether new 
and original treatment of variational problems, made a deep 
and lasting impression on Euler, who from then on, in spite 
of the difference in age, treated Lagrange as an equal. La- 
grange had been born in Turin in 1736; he lived there, 
except for a trip to Paris in 1763-1764, until 1766, when 
he moved to Berlin as Euler’s successor. In 1787 he finally 
settled down in Paris; by that time he had no active interest 
in number theory any more. He died in 1813. 

Much if not all of Lagrange’s work is based on that of 
Euler; this applies in particular to his contributions to number 
theory, about which he once wrote to Euler: 

“Je suis trés charmé que mes recherches sur les problemes inde- 
terminés aient pu meriter votre attention, le suffrage d'un savant 
de votre rang est extrémement flatteur pour mot, surtout dans une 
matiere, dont vous étes le seul juge compétent que je connorsse. Il 
me semble qu’il n’y a encore que Fermat et vous qui se sovent occupés 
avec succés de ces sortes de recherches, et st jar été assés heureux 
pour ajouter quelque chose a vos decouvertes je ne le dois qu’a 
létude que jai faite de vos excellens ouvrages.” (“Il am most 
happy that my work on indeterminate problems should 
have been found worthy of your attention; the favorable 
opinion of a scientist of your rank is most flattering for me, 
particularly in a field where, to my knowledge, you are the 
one and only competent judge. It seems to me that only 
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Fermat and you have been successful so far in dealing with 
such matters, and, if I have been so lucky as to add to your 
discoveries, I owe it to nothing else than the study I have 
made of your excellent works”: Eu.LV A-5.471[12 Feb.1770]. 

This was no mere politeness or flattery; in his corre- 
spondence with d’Alembert and others, Lagrange expressed 
himself about Euler more freely and more critically, no 
doubt, but always with the same sense of deep indebtedness 
and admiration. 

This quotation, and the one in §II above, are part of an 
exchange of letters between Euler and Lagrange concerning 
the latter’s great memoir on indeterminate equations 
(Lag.11.377-535), published in the Nouveaux Mémoires of 
Berlin in 1769; they had reached Euler in 1769 or early in 
1770. Partly, no doubt, because of this exchange, Lagrange 
composed in 1770 a continuation of that memoir (Lag.11.655— 
726) which Euler received in 1770 or early in 1771 (cf. 
Eu.1V A-5.488]20 May 1771); it contained a proof for the 
fact that a congruence of degree n modulo a prime p has 
at most 7 solutions modulo p (loc.cit.pp.667—669). 

This was precisely the point which Euler had missed when 
writing the Tractatus; one may surmise that he was quick to 
realize it and revise his thinking accordingly about the mul- 
tiplicative group modulo a prime. At any rate, scarcely more 
than a year later, we find him, on 18 May 1772, presenting 
three papers’ on m-th power residues to the Petersburg 
academy (E£u.1-3.240-281 = E449,497-512 = E552,513- 
543 = E 554). 

Strangely enough, Lagrange’s proof (which, incidentally, 
is made explicit only for congruences of degree 3, but is 
rightly stated to be general) had been based on a modification 
of Euler’s argument by finite differences. It may well have 


4 Only the first one appeared in the Novi Commentart of the Academy, 
in 1774. The other two were included by Euler in his Opuscula Analytica 
of 1783; perhaps he had waited with them, hoping to have more to say 
on the matter, but then, feeling his time to be running out, he wished 
to see them printed quickly. 
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been inserted by Lagrange into his paper as an afterthought; 
at any rate no use is made of it there, which is perhaps why 
Euler did not quote Lagrange in this connection. Euler’s 
proof for it is the modern one. Nowadays one expresses it 
concisely by saying that the integers modulo a prime make 
up a field (as was well known to Euler, in substance if not 
in words) and that the classical argument, already known 
to Descartes (loc.cit. Chap.I], §VII), for proving that an 
equation f(x) = 0 of degree n has at most n roots is valid 
over any field; in fact, if a is a root, one can divide f(x) by 
x — aand write 


f(x) = (& — a)fi(x); 


this cannot be 0 unless x = a or f\(x) = 0; as f; is of degree 
n — I, induction on n proves the assertion. In order to 
express the same idea, Euler observes (Eu.I-3.249) that, if 
a is a solution of the congruence f(x) = 0 (mod p), then one 
can write f(a) = mp, where m is an integer, so that a is a 
root of f(x) — mp in the usual sense. Writing now 


f(x) — mp = (x—a)fi(x) 


and applying induction as above, or alternatively (as Euler 
does) proceeding by successive divisions until all the roots 
have been exhausted, one gets the desired result. 

This theorem (made explicit, incidentally, only for the 
case f(x) = x" — 1, although the proof is quite generally 
valid) is then used to show the existence of primitive roots 
modulo any prime p; the general design of the proof is 
apparent, even though some clumsiness (whether due to 
Euler or to his assistants) tends to obscure the details. On 
the one hand Euler shows in substance that there is for 
each n a polynomial F,, of degree ¢(m) (which later came to 
be called “the cyclotomic polynomial” of that degree) such 
that x” — | is the product of the polynomials F(x) taken 
over al] the divisors d of n (including | and n); the roots of 
F,,, whether “real” or “imaginary”, are then precisely the 
primitive n-th roots of unity. The proof is by induction on 
the number of prime factors of n, distinct or not, even though 
it is carried out only up to three factors (giving occasion to 
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Gauss, in a fit of surliness, to claim that Euler’s result had 
been proved “only by induction”, i.e., in his use of the word, 
by experimentation); actually Euler obtains F,,(x) by dividing 
out x” — | by all the “cyclotomic polynomials” belonging 
to the divisors of m other than n. Thus, for each n, we may 
write 


— 1 = F,x)G,{x) 


where G,, is the product of the polynomials F, for d dividing 
n and <n. For the validity of Euler’s argument the fact 
would still be needed that the coefficients of F,, and G,, are 
integers; this is easily seen, since they are obtained by suc- 
cessive divisions by monic polynomials (i.e., polynomials 
with highest coefficient 1) with integral coefficients, but Euler 
fails to mention that point. The final step is then to take 
n = p — | and to note that the polynomial x’~' — 1, taken 
modulo p, has p — | distinct “real” roots 1, 2, ...,p—1, so 
that its divisor F,,_ (x) must itself have as many roots modulo 
p, from among the values |, 2, ..., p — 1, as its degree 
g(p — 1) indicates, these are the “primitive roots” modulo 
p. This argument amounts to saying that, over the field of 
integers modulo p (or indeed over any field), every divisor 
of a polynomial which splits into a product of linear factors 
must itself split into such factors; translated into Euler’s 
language, this is easily seen to hold, even though Euler does 
not make the point quite explicitly. 

As to the consequences which Euler draws from the ex- 
istence of primitive roots, in that same memoir and in the 
two companion ones presented on the same day to the 
Academy, some seem obvious to us and can indeed not have 
cost much effort to Euler; others concern theorems and 
conjectures related to the quadratic reciprocity law and will 
be discussed later. Noteworthy is Euler’s increasing awareness 
that in such matters he is dealing, not with individual integers, 
but with what we would call congruence-classes, i.e., in 
modern notation, with elements of the prime field F, 
Z/pZ, for this concept he introduces at first the word ordo 
(Eu.1-3.242), then the word species (Eu.1-3.519); for the group 
of quadratic residues modulo p, and for the set of non- 
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residues, each consisting of 3(p— 1) “species”, he has the word 
classis (residuorum resp. non-residuorum). Even more boldy, 
he introduces the notation a/8 for the congruence-class y 
defined by By = a, carefully explaining that no fraction is 
meant here, but the congruence-class with the representative 
(a + np)/B when a, 8 are representatives for their respective 
congruence-classes, and n is so chosen that (a + np)/B is 
an integer, as is always possible if B is prime to p (Eu.I- 
3.521). For the pair a, 1/a, he has the word sociata (Eu.I- 
3.524), or socia (Eu.I-3.248), or reciproca (Eu.1-3.503), and 
applies this concept to an elegant proof for the quadratic 
character of — 1 modulo a prime p = 2n + 1, as follows: 
observing that, if @ is a quadratic residue, then so is I/a, 
he concludes that the n quadratic residues can be arranged 
into pairs {e, a’} with e’ = 1/a, where a # a’ unless a” = 1, 
ie. unlessa = +1 (Eu.1-3.507,525); thus n is even or odd 
according as — 1 occurs among the quadratic residues or 
not. Of course it did not escape him that this can also be 
derived directly from the existence of primitive roots; 
indeed, if r is such a root, then, since p divides 


Pt — 1 = ("= 1)(r'+1) 


but not r” ~— 1, ” has the residue —1 modulo p; on the 
other hand, for any m, r” is or is not a quadratic residue 
according as m is even or odd; hence the desired result 
(Eu.I-3.255,260,262). In the same vein, when Euler received 
from Lagrange a paper containing two proofs for “Wilson’s 
theorem” (Lag. 11.425—438|1771; cf. Chap.IL, §VII), he sent 
him at once one of his own, based on the observation that, 
if r is a primitive root modulo the prime p = 2n + 1, then 


(p- Its rl ttt te-2) — ren N= ( 1)?r-} | (mod p) 


(EulV A-5.496]1773; cf. Eu.1-4.91—93 in E 560|1773. 


§VIL. 


The use of the words “real” and “imaginary” (or “im- 
possible”) in connection with solutions of congruences, in 
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Euler’s treatment of primitive roots modulo a prime (Eu.I- 
3.249, Scholion, and 252, Coroll.1, E 449|1772) should not 
pass here without comment. In the usage of the eighteenth 
century, and more specifically in Euler’s, the word “ima- 
ginary” appears with two different meanings; it denotes 
either what we would call a complex number, i.e. an element 
of the field R(V— 1), or else, more vaguely because the con- 
cept could not yet be made precise, an element of any al- 
gebraic extension of some groundfield when this extension 
could not be embedded in the field R of real numbers. The 
first concept, motivated by the work of the Italian algebraists 
of the sixteenth century on equations of degree 3 and 4, 
had been introduced by Bombelli in Book I of his Algebra 
of 1572, in close imitation of Euclid’s theory of irrationals 
of the form a + Vb in his Book X (“binomia” in the termi- 
nology of Campanus’s Latin translation of Euclid; accord- 
ingly, binomio is Bombelli’s word for a complex number 
a + V—}). Further developments belong to the history of 
algebra and analysis rather than to number theory; suffice 
it to say that Euler played a decisive role in extending to 
complex numbers the main operations of analysis. 

On the other hand, ina series of letters addressed in 1735 
to the Dantzig burgomaster C. L. G. EHLER for perusal 
by his fellow-citizen, the scientist H. KUHN (PAU.282-329, 
notably pp. 297-301 and 324-328), Euler stated his views 
about the successive extensions of the concept of number, 
first from ordinary to positive and negative integers, then 
to rational and to various kinds of (real) irrational numbers, 
and beyond them to algebraic extensions; at each step, in 
Euler’s view, all that is needed is that the usual algebraic 
rules should continue to hold and no contradiction should 
ensue; as to the last point, Euler is clearly satisfied with his 
own extensive experience and that of all competent math- 
ematicians. In this view, every polynomial must split into 
as many linear factors as its degree indicates; for instance, 
“I decide”, he says (“tam statuo”: loc.cit.p.325) that x* — 
ax’ — bx — ¢ consists of three factors x — p,x ~ q,x — 7. 
Having worked out to his satisfaction the example 
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x*> — 8 = 0, where two complex roots appear, Euler rests 
his case, and indeed Kuhn finds this so convincing that he 
withdraws all his objections. 

In the language of Euler and his contemporaries, the so- 
called “fundamental theorem of algebra” (no more and no 
less, in modern terms, than the fact that R(V—1) is alge- 
braically closed) could be expressed by saying that all “im- 
aginary” roots of algebraic equations can be written as 
a + bV—1; an equivalent statement, as Euler knew, is that 
every polynomial (with real coefficients) can be written as 
a product of factors of degree | or 2. The question acquired 
special importance in integral calculus because the inte- 
gration of rational functions by logarithms and inverse tri- 
gonometric functions depends upon it (cf. mfra, XV). In 
1742 Euler convinced himself that all rational functions can 
be so integrated; in 1703 Leibniz had expressed the opposite 
view (Math.Schr., ed. C. 1. Gerhardt (II) Bd.1.360); even in 
1742 Nicolas BERNOULLI, an older cousin of Euler’s friend 
Daniel, thought he could offer a counterexample and did 
succeed in shaking Euler’s confidence for a brief moment 
(Corr.1.170-171, cf. Corr.11.695 and L. Euler, Opera Postuma 
... Petropoli 1862 = Kraus Reprint 1969, pp. 525-539) 
before finally accepting Euler’s arguments. In spite of con- 
tributions by d’Alembert, Euler, Lagrange, Gauss, the matter 
cannot be said to have been fully understood until much 
later (until Cauchy and his successors, from the point of 
view of analysis; until Artin and Schreier, from the purely 
algebraic point of view); this story need not concern us 
here. 

In Euler’s memoir E 449|1772 on m-th power residues, 
however, the groundfield is implicitly the prime field F, = 
Z/pZ of integers modulo a prime p. Euler’s word for a solution 
of a congruence f(x) = 0 (mod ) is casus (“a case”) or also 
solutio, i.e. a solution for the problem of making f(x) divisible 
by p; if x = ais such, then he says that all the values a + 
mp are to be counted as one and the same “case” (“pro unico 
casu”: Eu.1-3.249, Scholion), All such “cases” are “real” if 
f(x) = x7’ — 1; therefore the same is true if f(x) is any 
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polynomial dividing x’~' — 1 (Eu.1-3.252, Coroll. 1); on the 
other hand some of them will be “impossible” and “so to 
say imaginary” (“quasi imaginarit”: Eu.J-3.249, Scholion) if 
f(x) = x” — land nis not a divisor of p — 1. Undoutedly 
what Euler has here in mind, in still undeveloped and obscure 
form, is the concept, introduced by Galois in 1830, of “Galois 
imaginaries” as they came to be called at one time, i.e. of 
the finite algebraic extensions of the field of integers modulo 
p, or in other words the finite fields. 


SVIII. 


All his life Euler was deeply concerned with the repre- 
sentation of integers, at first as sums of two squares, then 
by quadratic forms (he said “formulas”) X° + NY? or more 
generally uX” + vY° where N, pu, v are given integers, positive 
or negative. As to forms aX* + bXY + cY®, it is only in his 
old age, and then only under the influence of Lagrange, 
that he paid any attention to them (cf. infra, §1X), except 
that the form X? + XY + Y? had, unavoidably but fur- 
tively, made an appearance in connection with his work on 
cubic residues and the form X? + 3Y? (Eu.I-2.572—574 in 
E 272|1759). 

The multiplicative nature of such questions is evidenced 
by “Brahmagupta’s identity” (cf. above, Chap.I, VIII, and 
Chap.II, §XI1): 


(x* + Ny?)(z? + Ne?) = (xz + Nyt)? + N(xt¥yz)°. 


At least in special cases, Euler must have been aware of this 
identity quite early, perhaps in connection with his first 
attempts to deal with Pell’s equation and related problems 
(cf. Corr.1.36-37/1730, Eu.I-2.6-17 = E 29|1735, and infra, 
§XIII) long before he found it necessary to spell it out in 
the case N = — 2 (Eu.1-2.237 in E 167|1748); for the simplest 
case N = I, cf. Corr.1.134|1742, and Euler’s first memoir 
on sums of two squares where it is still mentioned as “note- 
worthy” (“notatu dignum”: Eu.1-2.300 in E 228/1749). The 
general case, implicit in a letter from Goldbach 
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(Corr.1.612]1753) and surely well known to Euler by that 
time, is stated in the latter’s letter of August 1755 (Corr.1.629) 
and then again, as a theorem of capital importance (“theorema 
eximium”: Eu.1-2.6G60 in E 279|1758) in a memoir on inde- 
terminate equations, where it leads at once to the conclusion 
that the representation of integers by a form X? + NY? is 
a multiplicative problem and should first of all be studied 
in the case of a prime (2bd.,p.601). 

As Euler knew, Fermat had treated the cases N = 1, 2, 
3 and had announced his results in letter XLVII of the 
Commercium of 1658 (loc.cit. above, §IV; cf. Chap.II, 88 VIII 
and XII). But, when Euler started taking up those problems 
seriously, he had long been aware of Fermat’s earlier state- 
ment that a sum a” + 6°, with a prime to ), has no prime 
divisor p = 4n — | (Fe.11.204 = Varia Opera, p.161; cf. 
Chap.II, §VII), nor consequently any divisor 4n — 1, be it 
prime or not. Obviously this had made a deep impression 
on his mind; quite early (“vor langer Zeit: Corr.1.107|1741; 
cf. Corr.I.114|1742) he had transformed it into the statement 
that neither 4mn — m — 1 nor 4mn — m — ncan bea square 
if m, n are positive integers; indeed, if 4mn — m — 1 (resp. 
4mn ~— m ~ n) were a square a’, this would imply 


m(4n—1) = 1 + a’ resp. (4m—1)(4n-1) = 1 + 4a’, 


so that 4n — I would divide a sum of two mutually prime 
squares. 

Thus Euler was confronted not with one but with three 
types of questions, which may be formulated thus, for any 
given integer N (positive or negative, but other than 0 and 
—1): 


(i) Which are the primes that can be written as a” + Nb’, a and 
b being integers? 

(it) What are the “prime divisors” of the form X° + NY”, if by 
this one understands with Euler the odd prime divisors 
(necessarily prime to N) of some integer a? + Nb? with 
a prime to Nb? 

(iii) Which odd integers are divisors of some integer a? + Nb® 

with a prime to Nb? 
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As Euler must have realized rather soon, the integers 
(prime to 2N) that satisfy (i11) are precisely those modulo 
which —N is a quadratic residue (cf. Eu.I-2.209, Annot.9, 
and 219, Annot.18, in E 164/1744; cf. also no.300 of the 
Tractatus, Eu.1-5.229—230). In fact, if a? + Nb? = mn with 
a prime to Nb, a and Nb must be prime to m; then there is 
b’ such that bb’ = 1 (mod m) (cf. above, Va) and we have 
— N = (ab')? (mod m). The converse is immediate. If N = 1, 
for instance, this shows that an answer to (ii) must amount 
to the determination of the quadratic residue character 
of —1 modulo any odd prime (cf. above, §VI, and Chap.I, 
§VII). If for convenience we introduce the Legendre symbol 
(n/p), defined whenever p is an odd prime and nan integer 
prime to p, and equal to +1 or —1 according as n is a 
quadratic residue modulo p or not (cf. Chap. IV, §V}), then 
(ii) amounts to asking for the determination of (— N/p) for 
all primes p, not dividing 2N. On the other hand, from the 
point of view of the theory of quadratic fields, (1) and (ii) 
amount to finding the behavior of a rational prime p in the 
field Q(V—N): if it divides a® + Nb’ with a prime to Nb, 
then, in that field, it splits into the ideal prime factors 
(p,a+ bV —N); it will satisfy (i) if those factors are the principal 
ideals (a+ bV —N). Thus (i) lies deeper than (ii); to use Euler’s 
words for this, it is “altioris indaginis” (Eu.1-2.603 in 
E 279|1758); it depends upon the theory of ideal-classes in 
quadratic fields, or equivalently of classes of binary quadratic 
forms, while (ii), as will be seen below (cf. Appendix J), 
depends upon no more than the law of quadratic reciprocity. 

Throughout the years Euler was tireless in exploring all 
aspects of the above questions and bringing together what- 
ever bits of information he could gather about them. Already 
in 1741 we find him reporting to Goldbach the “curteuse 
proprietates” he has discovered concerning the prime divisors 
of X° — 2¥?, X® — 3Y?, X? — 5Y? (Corr.1.107/9 Sept.1741); 
in the latter case this already went beyond anything that 
Fermat had stated. The next year the “cuneuse proprietates” 
had blossomed forth into “a very pretty arrangement” (“eine 
sehr artige Ordnung”: Corr.1.146|28 Aug.1742) which he at 
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once started to describe on the basis of his experiments, 
while predicting that this was only the prelude to further 
“splendid properties” (“herrliche proprietates”: ibid., p.150) 
yet to be discovered. Not later than 1744 (“vor einiger Zeit”: 
Corr.1.279\4 July 1744) he put together all his empirical 
observations on the prime divisors of X* + NY® (covering 
16 positive values and 18 negative values for N), added 
theoretic conclusions in the form of a series of Annotationes, 
and sent the whole to Petersburg. This is the memoir E 164 
(= Eu.1-2.194-222) which the Academy published in 1751. 
From now on we shall assume N to be square-free; this 
does not restrict the generality, so far as (ii) and (iii) are 
concerned. In modern terminology and notation, the answer 
to (i1) is best described by means of the “Dirichlet character” 
attached to the field Q(V —N). This is a certain function w 
of integers, taking the values + | and 0, periodic of period 
D = |N| if N = —1 (mod 4) and D = 4{N| otherwise; it is 
such that a prime Pp satisfies (11) if and only if w(p) = 1; in 
other words, one has, for all primes p not dividing 2N: 


-N 
op = (=), 


A construction and full characterization of » will be given 
in Appendix I. 

Let Ky be the set of congruence classes prime to 4/N| 
modulo 4|N| on which @ takes the value 1; thus a prime 
satisfies (ii) if and only if its congruence class modulo 4|N| 
belongs to Ky. As will be shown in Appendix I, Ky can be 
characterized by the following properties: 


(A) Ky isa subgroup of index 2 of the multiplicative group modulo 
4|N|. This implies that it contains all quadratic residues 
prime to 4|N| modulo 4|N|. 

(B) —1 belongs to Ky or not, according as N <OorN > 0. 

(C) Ky admits the period D and no period D' < D. By this is 
meant that, ifr ands are prime to 4|N andr = s(mod D), 
then both belong to Ky whenever one of them does, 
and that no divisor D’ of D, smaller than D, has the 
same property. 
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This characterization of Ky may thus be regarded as pro- 
viding a complete answer to question (ii). 

Euler’s first major discovery, on the basis of his experi- 
mental data, was to observe that the prime divisors of 
X? + NY? are precisely those primes which belong to a 
certain number of arithmetic progressions 


P(r) = {7,4|N|+7,8 


N| +r,etc.} 


where r is prime to 4|N| and 0 < r < 4|N| (loc.cit., Annot.3, 
13; cf. Corr.1.147/1742). 

Here it is well to keep in mind Dirichlet’s theorem ac- 
cording to which every progression such as P(r) contains 
infinitely many primes; were it not so, Euler’s experimen- 
tation could hardly have been carried out so successfully. 
Because of this, the set of congruence classes modulo 4|N| 
determined by the integers 7 for which P(r) has the property 
in question is no other than the set Ky introduced above. 

At first, writes Euler to Goldbach, there seems to be no 
regularity (“keine Ordnung”: Corr.I.147) in the numbers 
making up the set Ky; nevertheless they proceed according 
to “a beautiful law” (“nach emer schénen lege”, as he puts it 
in his mixed German and Latin: ibid.). In the first place, all 
powers and products of powers of elements of Ky belong 
to Ky (Corr.1.148,150; Annot.8,16); in our language, this 
means that Ky is a subgroup of the multiplicative group 
modulo 4|N|. Since obviously every integer satisfying (iii) is 
a product of primes or powers of primes with the same 
property, this implies the important fact that also all such 
integers must belong to some progression P(r) with rin Ky 
(cf. loc.cit. pp.220-—222, Schol.2,3); perhaps it was this that 
led Euler to the discovery of the group property for Ky. 
The same remark shows in particular that all odd integers 
a’ + Nb’ with a prime to Nb must belong to some P(r) with 
rin Ky; for 6 = 2 and a prime to 2N, it shows that all 
quadratic residues modulo 4|N| belong to Ky (Annot.7, 16). 
Furthermore, if r belongs to Ky, —r belongs to it if N < 0 
but not if N > 0 (Annot. 3, 13); and the number of elements 
of Ky is half the number of congruence classes prime to 
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4|N| modulo 4|N| (Annot.5, 14) i.e. $ g(4|N)) if ¢ is the “Euler 
function”. Thus Euler had found that Ky has the properties 
(A) and (B) listed above. 

As to (C), Euler tells us that, if r belongs to Ky, so does 
r + 2\N| for N = —1 (mod 4) (Annot.11, 19), while this is 
not so otherwise (Annot.12, 20). This means, in the former 
case, that Ky admits the period ||; in fact, if r and s are 
prime to 4/N|, they are both odd, so that, ifr = s (mod |N)\), 
smust be =ror =r + 2|N| (mod 4|N)). In view of this, and 
of Euler’s specific mention of the fact that this is not so for 
N = 1 or 2 (mod 4), we may surmise that he had indeed 
looked for other periods; if we credit him with having per- 
ceived that there are none, we may thus conclude that he 
was in possession (conjecturally of course) of the full char- 
acterization of the set Ky and thus of a complete answer to 
question (ii). 

With this Euler had wandered far ahead of the paths 
trodden by Fermat. Hidden in his guesses, as perhaps the 
most valuable nugget of his treasure-trove, lay the law of 
quadratic reciprocity (cf. Appendix I, and above, §V2), which 
he was not to recognize as such until quite late in life. On 
one occasion he did push his experiments even further; 
while composing for the Tractatus two highly tentative chap- 
ters on cubic and biquadratic residues, he looked for con- 
ditions for 2, 3, 5, 6, 7, 10 to be cubic residues and for 2, 
3, 5 to be biquadratic residues modulo a prime p (Eu.I- 
5.250-251, nos.407-410, and 258-259, nos.456—457). As 
to the latter question, it arises only for p= 1 (mod 4), since 
otherwise, as Euler notes (ebzd.p.252, no.419), every quadratic 
residue is a biquadratic residue; thus Euler can write p = 
a’ + 46? and find that the biquadratic character of 2, 3 resp. 
5 modulo p depends only upon the values of a and 6 modulo 
4, modulo 6 and modulo 5, respectively. Similarly, for cubic 
residues, it is enough to consider the case when p can be 
written as p = a’ + 3b°, and then the cubic residue character 
of 2, 3, 5, 7 resp. 10 depends only upon the values of a and 
b modulo 3, modulo 9, modulo 15, modulo 9, modulo 21 
and modulo 5, respectively, ina manner correctly described 
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by Euler. These brilliant but wholly isolated observations 
were lost to the mathematical world; in 1849, when the 
Tractatus was first published, Gauss, and then Jacobi and 
Eisenstein, had already proved all that Euler had guessed, 
and of course much more (cf. Eu.I-5, pp.XXI-XXIV). 


SIX. 


In the years 1741 to 1744, while assembling experimental 
data on the prime divisors of the forms X* + NY?®, Euler 
sought to use his conclusions so as to generalize his former 
remark about integers 4mn — m — n not being squares. 
For instance, as he writes to Goldbach (Corr.I.162|27 Oct. 
1742), since his observations imply that the form X*? + NY’, 
for N > 0, has no divisor r = 4Nx — 1, an integer 
4Nmn — m — n, for positive m and n, can never be a square; 
in fact, if it were equal to a’, this would give 


(4Nm—1)(4Nn—-1) = 1 + 4Na?. 


Perhaps, he adds, the latter statement will be easier to prove 
than the former one (ibid.; cf. Corr.1.179|5 Jan.1743). Almost 
no letter passed between the two friends, from 1741 to 
1744, without this topic being somehow touched upon; 
even in his memoir of 1744 he included at the end a list of 
“formulae” 4Nmn — A(m+n) whose value, in view of his 
observations, can never be a square for m and n positive 
and prime to A (Eu.I-2.220—222; cf. Corr.1.260,278). Even- 
tually he must have given up his hopes of finding direct 
proofs for such statements. It is amusing, however, to observe 
Goldbach’s obstinate efforts to prove the simplest one of 
them, the one relating to 4mn — m — n, even though this 
was unnecessary in view of Euler’s proof (sent to Goldbach 
on 6 March 1742, and repeated on 19 January 1743: 
Corr.1.115—117,191—192) that X° + Y? has no prime divisor 
p = 4n — 1. With some prodding and some help from 
Euler, Goldbach was at last successful (Corr.1.255—256|28 
Sept. 1743; cf. Euler’s answer, pp. 258~259|15 Oct.1743); 
his proof, perhaps his only contribution of value in the 
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whole correspondence, is by descent; had he and Euler but 
known it, it contains the first germ of the Lagrangian re- 
duction theory for binary quadratic forms (cf. below). 

Except for this episode which had led into a blind alley, 
Euler, for many years, concentrated his efforts upon the 
proof of Fermat’s statements about the “elementary forms” 
X? + NY? for N = 1, +2, 3, corresponding to fields 
Q(V —N) with a euclidean algorithm (cf. Chap.I], §§VIII, 
XI and Appendix I); in those fields integers can be decom- 
posed into primes, just as in ordinary arithmetic, so that 
every prime divisor of such a form X* + NY? can be re- 
presented by it; in other words, the answer to question (i) 
of §VIII is the same as for (ii). Those facts had been stated 
and presumably proved by Fermat; Euler had to reconstruct 
the proofs, and it took him many years. 

His first step in this direction had been to deal with the 
easier part of (ii) for the form X* + Y°, by proving that no 
prime p = 4n — | can bea prime divisor of that form, or 
in other words that (— I/p) = — 1 for such a prime; this 
had been an immediate consequence of Fermat’s theorem 
a’! = 1 (mod p) (Corr.1.115~117|1742; cf. Chap.II, §VI). 
Decisive progress was made with question (i), also for N = 1, 
in 1747; in a letter to Goldbach (Corr.1.416—-417) Euler 
describes his newly found proof; it is the one which has 
been reproduced in Chap.II, §VIII; the crucial lemma? 
(lemma 2 of Chap.II, VI] appears in that letter as Theo- 
rem 2. Thus it was proved that every prime divisor of 
X* + Y* can be written as a” + b° and is consequently of 
the form p = 4n + 1. 

It still remained to be shown that every prime p = 4n + 1] 
is a divisor of X* + Y?, ie. that (- 1/p) = 1 for such a 
prime; as we have seen, this was accomplished in 1749 (cf. 
above, §§Vb and VI). As to the letter of 1747, its contents 


° Undoubtedly the more general statement (Corr.1.134|1742) that “if 
ab and a are sums of two squares, so is 6” had been premature, in spite 
of Euler’s assertion that he could prove it quite rigorously (“rigidissime”). 
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went, in part almost word for word, into Euler’s first memoir 
on sums of two squares (Eu.I-2.295-327 = E 228|1749), 
including unfortunately a rather negligent and imperfect 
presentation of the final part of the argument. This was 
followed in 1753 by a very lucid treatment, by the same 
method, of the form X° + 2Y° (Eu.1-2.467—485 in E 256|1753), 
and in 1759 by its extension to X* + 3Y? (Eu.I1-2.556-575 
= E 272|1759; cf. Eu.I-2.481—482 in E 256|1753); the proofs 
for these cases are in substance those described in Chap.I], 
§XII. 

It had thus been proved that every prime divisor of 
X? + NY®, for N = 2 or 3, can be written as a? + Nb® and 
thus must be = 1 or 3 (mod 8) for N = 2, and = 1 (mod 3) 
for N = 3; Euler had still to prove the converse. For N = 3, 
this was done in 1759 (Eu.I-2.573 in E 272|1759, Prop.9); 
the proof, which has been reproduced in Chap.II, §XII, is 
based on the identity 


0" P= (x= 1)" 4x" 4+ 1). 


In the Scholion to that proof (dbid.,p.575), Euler mentions 
that he knows also how to prove that a prime p = 8n + 1 
is a divisor of X? + 2Y? (but not how to prove the similar 
result for p = 8n + 3); undoubtedly he alludes there to 
the proof published in 1772, and reproduced above in 
Chap.II, §XII, this being based on the identity 


x8 —je= (x" — L)[(x?" - 1)? + 2x?" 


(Eu.1-3.274-275 in E 449|1772, no.85). In this last paper, 
however, he can go further, thanks to “a communication 
from a friend” (perhaps Lexell); this friend, in fact, merely 
pointed out that Euler’s argument for proving that every 
prime divisor of X° + 2Y* can be written as a> + 2b? is 
equally valid for X° — 2Y° (ibid., p.275; cf. Eu.1-3.183 
in E 427|1772). Therefore a prime p = 8n + 3 cannot 
be a divisor of X? — 2Y?, so that, for such a prime, we have 
(2/p) = -1; as we have p = 3 (mod 4), which implies 
(— I/p) = — 1, we get (— 2/p) = 1, as was to be proved (cf. 
Chap.II, §XII). This completes the determination of (+ 2/p) 
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for all odd primes, since it shows that (—2/p) is | if and 
only if p = 1 or 3 (mod 8) and since (— 1/p) is known. In- 
cidentally it is curious to note that in a paper presented to 
the Petersburg Academy on the very same day as the one 
just mentioned (Eu.1-3.511 in E 552|1772, Scholion 3; cf. 
also Eu. 1-4.38 in E 557|1773, nos.24—25) Euler, or whoever 
was writing for him, appears to be unaware that the result 
(—2/p) = 1, for p = 8n + 3, has just been proved. By that 
time, perhaps, Euler’s papers were sometimes little more 
than transcripts of his conversations with his assistants. 
Thus, in 1772, Euler had fully caught up with Fermat in 
the matter of the prime divisors of the forms X? + Y?, 
X* + 2Y?, X? + 3Y”, and of the representation of integers 
by those forms. At one point he went beyond Fermat by 
making an observation which attracted Lagrange’s attention 
and then was widely generalized by Gauss (cf. infra, Chap.IV, 
§VI). Gauss proved that, if p is an odd prime, V+ is con- 
tained in the field of p-th roots of unity, the sign being 
determined so that + p = | (mod 4). This implies that the 
“cyclotomic polynomial” X*~' + X?~? + + + 1 splits over 
the field OW =p), so that there are two homogeneous poly- 
nomials F(X, Y), G(X, Y) of degree 3(p— 1), with integral 
coefficients, satisfying an identity , 


A(XP>! 4. XP 2p pF VPO!) = F(X, VY)? — (+p)G(X, ¥)?. 


For p = 3, for instance, one can take F = 2X +Y,G = Y, 
and this was a main ingredient in Euler’s treatment of 
X® + 3Y*. Similarly, if also g = pn + lisa prime, the above 
identity, for X = a", Y = b", gives: 


4(a’) —B-!) = (a"—b)[F(a", b")?  pG(a", b”)"], 


which, by the same argument as in the case p = 3, shows 
that +p is a quadratic residue modulo q. Euler carried this 
out for p = 5 and p = 7 (Eu.1-3.280-281 in E 449|1772, 
nos.96—97); so did Lagrange a little later, but he failed in 
his attempt to extend it to p = 11 (Lag. 1H1.789-792\1775). 

As to the representation of primes by forms X? + NY? 
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(question (i) of §VIII) for other values of N than those 
considered by Fermat, Euler had for a long time to remain 
content with the experimental results he had inserted in 
his memoir of 1744. Already in 1742, after communicating 
to Goldbach the “herrliche proprietates” described above in 
§VIII, he had touched upon that question, announcing rashly 
that every prime divisor of X* + NY* can be represented 
by that form and giving as examples the cases N = 1, 2, 3, 
7 (for which it is true) and N = 5, 6, for which it is false 
(Corr.1.146—148). In 1743 he still thought that if a prime p 
is a quadratic residue modulo 4N, for N > 0, then it can be 
written as a” + Nb? (Corr.1.264). In his memoir of 1744 he 
included a list of experimental findings concerning 12 pos- 
itive values of N, and the values N = —2, —3; as typical 
examples we quote: 

1° If p is a prime divisor of X* + 5Y*, it can be written 
as a” + 5b” for p = 1 or 9 (mod 20); 2p can be so written 
for p = 3 or 7 (mod 20) (Eu.I-2.196, Theor.11). 

2° All prime divisors of X* + 7Y* can be written as 
a’ + 70° (ibid.,p.197, Theor. 14). 

3° If pis a prime divisor of X* + 17Y?, either p or 9p can 
be written as a* + 176° when pisa quadratic residue modulo 
4 x 17 = 68; otherwise 3 can be so written (2bid.,p.199, 
Theor.23). 

4° If p is a prime divisor of X* + 14Y%, it can be written 
either as a® + 146° or as 2a? + 7b whenitis = + 1 (mod 8); 
otherwise 3p can be so written (2bzd.,p.203, Theor. 35). 

He must have found such phenomena puzzling; even ten 
years later, in a memoir significantly entitled “Specimen de 
usu observationum in pura mathematica” (“On the use of ob- 
servations in pure mathematics”: Eu.J-2.459-492 = 
E 256|1753) and dedicated largely to the form X* + 2Y?, 
he repeated some of his earlier statements, commenting 
that “they seem certain but will surely be quite hard to prove” 
(loc.cit.p.486). In that same year he was pointing out to 
Goldbach that a prime p = 4Nx + | is not necessarily 
representable by the form X* + NY® “in integris”, i.e. as 
a’ + Nb’, a and b being integers; nevertheless he still felt 
sure (and rightly so; cf. Chap.IV, Appendix I) that it can 


Euler 215 


always be so represented rationally (“in fractis”: Corr.1.605— 
606]1753); in other words, the equation p = x° + Ny’ must 
have a rational solution (x, y), even if it has none in integers. 
In 1772 he had a memoir presented to the Academy (Eu.1- 
4.1-24 = E 556) on the integers which can be represented 
rationally by a quadratic form ¢X* + vY”; little is new there, 
apart perhaps from some formulas containing special cases 
of the composition of quadratic forms (p.2, Theor.1, and 
p.4, Theor.2; cf. Chap.IV, $VI), not to mention a confused 
and confusing variant (pp. 18—24) of Lagrange’s method of 
solution for equations ax” + by? + cz> = 0 (cf. Chap.II, 
§XIV, and Chap.1V, $11); amazingly Lagrange is not men- 
tioned there, although his paper (Lag.11.377-535|1768) had 
for some time been in the hands of Euler, who had com- 
mented upon it in a letter to Lagrange (Eu.IV A-5.466|16 
Jan.1770). 

Except for this puzzling item, there is no sign of Euler’s 
having made any progress with these matters until he re- 
ceived from Lagrange in 1775, along with a volume of the 
Turin Miscellanea, the volume of the Berlin Mémoires con- 
taining Lagrange’s Recherches d’Anithmétique (= Lag.111.695— 
758|1775). On sending this to Euler, Lagrange, with his 
usual lack of self-assurance, has expressed how eagerly he 
was awaiting Euler’s verdict on his latest work: 

“Si vos occupations et Vetat de votre santé vous ont permis de 
jetter les yeux sur le peu que j'ai donné dans les Memorres de Berlin 
et de Turin, je vous supplie de vouloir bien m’en dire votre 
avis; ma plus grande ambition a toujours été d’obtenir votre suf- 
frage ...” [“If your occupations and your state of health 
have let you glance at the little I have contributed to these 
volumes, I entreat you to give me your opinion on them; 
my highest ambition has always been to win your approval 
.. 7: EulV A-5.503—504|10 Feb.1775]. 

It would be wrong to tax him with insincerity, or even to 
suspect him of fishing for compliments (cf. Chap.IV, §II). 
Anyway, Euler greeted that work with even more than his 
customary enthusiasm: 

“Il est bien glorieux pour mov”, he wrote, “d’avoir pour successeur 
a Berlin le plus sublime geométre de ce siecle .. . J'ai parcouru 
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avec la plus grande avidité les excellens mémoires dont vous avés 
enrichi les derniers volumes de Berlin et de Turin...” [“It is 
most flattering to me to have as my successor in Berlin the 
most outstanding mathematician of this century... I have 
avidly gone through your excellent contributions to the re- 
cent volumes...”: Eu.IV A-5.504 = Lag.X1V.241|23 March 
1775]. In that letter he makes no specific mention of the 
Recherches, but within a few months he had composed, under 
the title “De msigni promotione scientiae numerorum” (“On some 
famous progress in the science of numbers”: Eu.I-4.163— 
196 = E 598|1775) a kind of commentary on that memoir, 
which he presented to the Academy on 26 October 1775; 
it appeared posthumously in 1785. 

Not unlike many great discoveries, Lagrange’s basic idea 
was simplicity itself; he had already described it briefly in 
his Additions to Euler’s Algebra (Lag. VI1.125-127 = Eu.I- 
1.603—605, no.70); but at that time Euler, while acknow- 
ledging (rather perfunctorily: Eu.IV A-5.496 = 
Lag.X1V.235|1773) receipt of that volume, had failed to 
take notice of this brilliant idea. In the Recherches of 1775 
it is carried out in 4 pages (/oc.cit.pp.697—700); for the present 
the following will suffice. If — N is a quadratic residue modulo 
an integer e (not necessarily a prime), then, writing —N = 
f? — eg, we see that —4N is the so-called “discriminant” of 
the quadratic form F(X, Y) = eX? + 2fXY + gY’, while 
F(1, 0) = e (in general the discriminant of a form aX* + 
bXY + cY® is defined as 5 = b* — 4ac). “It may seem that 
nothing much is gained by this”, writes Euler (“parum hinc 
lucri ad nostrum institutum afferrt videtur”: loc.cit.p.165); 
“and so it would be, were it not for the illustrious Lagrange’s 
discovery that the infinitely many forms” (“infinitam . . . 
formularum multitudinem”) with the discriminant 6 can, for 
each value of 6, be reduced to “a small number” (“ad exiguum 
numerum’’). In fact, consider a form 


F(X, Y) = aX® + bXY + cY’ 


with the discriminant § = 6? — 4ac; here we merely assume 
that 6 is neither 0 nor a square; in particular, we assume 
ac + 0. Suppose that either |a| or |c| is < |b]; then, by sub- 
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stituting X — r¥ for X if |6| > al (resp. Y — rX for Y if 
|b| > |cl) and choosing r suitably, one can obtain a form 


PYX Y) = a,X* + b,XY + oY? 


with the same discriminant 8, but where |b,| < |a| < |b| (resp. 
|b,| < |c| < jbl), so that in any case [b,| < |b. Obviously F and 
F, are “equivalent” in the sense that they “represent properly” 
the same integers; this means that the set of values taken 
by F(a, 8) for a prime to B is the same as the similar set for 
F, (for more details, cf. Chap.IV, §I1V). Clearly the process 
of “Lagrangian reduction” by which F; has been derived 
from F can be repeated, but no more than a finite number 
of times since |b| decreases at every step, until one reaches 
a form 


(X, ¥) = pX® + gX¥ + 1rY®, 


still equivalent to F and still with the same discriminant 
8 = q° — 4pr, for which |p| and |r| are both =jq|. An easy 
calculation shows that q° < 46| if 5 < 0, q <5 if § > 0; as 
4pr = g° — 4, this implies that for a given 6 there are only 
finitely many such forms @. Moreover, as a = F(1, 0), we 
must have a = (a, B) with a prime to 8, and therefore 


4pa = 4p(pa® + qaB+rp?) = (2pa+qB) — 6B. 
If, for a given 4, one lists all the forms 
b(X, Y) = p:X? + qX¥ + n¥? (i =1,2,...,m) 


satisfying the above conditions for ¢, this shows that one 
of the integers 4p,a can be represented by the form X* — 
5Y*. The question still arises whether all the forms ¢, are 
needed for this purpose; this point had been examined in 
full detail by Lagrange (loc.cit.pp.723—740; cf. Chap.IV, 
8IV and App.II, III); Euler fails to mention it; either he 
found it less interesting, or else Lagrange’s rather laborious 
calculations for solving that question were more than the 
blind Euler could follow. 

In particular, take N squarefree as before, and let again 
e be a divisor of X* + NY?, i.e. a positive integer such that 
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—N is a quadratic residue modulo e. Put ~N = f? — eg 
and apply the foregoing process of “Lagrangian reduction” 
to the form 

F(X, Y) = eX? + 2fXY + g¥? 


with the discriminant = —4N; ¢ being as above, we have 
g — 4pr = —4N, so that q is even. Writing e = $(a, B), we 
get now 


» 
pe = (ne +26 + NB. 


Thus, if we call again 4, ..., @,, the forms such as ¢ for the 
discriminant 6 = —4N, one of the integers pe will be rep- 
resentable by the form X* + NY?®. Take for instance N = 
1; then there is no other form @ than ¢, = X? + Y? (dis- 
regarding — @, which takes no positive values); therefore 
every divisor of that form is representable by it, and in 
particular every odd divisor of X* + Y* is = 1 (mod 4). In 
substance, this is nothing else than the proof that Goldbach 
had so laboriously concocted in 1743. 

As this example suggests, many of Euler’s old conjectures 
could now be deduced from the above results; some of 
them are treated in Lagrange’s Recherches; more are added 
in a continuation of that memoir published in 1777 
(Lag. 111.759-—795|1775). As a further illustration we may 
quote the case N = 5,8 = — 20. Here we have the forms 
ob, = X? + 5Y®, dy = 2X” + QXY + 3Y”*; therefore, if p is 
an odd prime divisor of X* + 5Y’, either it can be written 
as p = a? + 50°, in which case p = 1 or 9 (mod 20), or 2p 
can be so written, in which case p = 3 or 7 (mod 20). This 
argument fails to show the converse, which had been the 
other part of Euler’s findings about that form, viz., that 
every prime p = 1, 3, 7, 9 (mod 20) is a divisor of X? + 
5Y*. Nevertheless Euler was delighted and took great pleas- 
ure in adding several examples to Lagrange’s, or rather, 
perhaps, in having them worked out by his assistants. 

What was still beyond the scope of Lagrange’s new methods 
was essentially the law of quadratic reciprocity. In his old 
age Euler succeeded at least in disengaging it from his early 
conjectures; it makes up the conclusion of a paper of 1772 
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on quadratic residues (Eu.1-3.510—512 in E 552|1772). His 
formulation for it, in terms of two odd primes p and s, 
amounts to writing the relation 


G) = 4 


where the sign is taken so that +p = 1 (mod 4) (cf. Appen- 
dix I). In1773 he again attracted Lagrange’s attention on 
such matters: “Je suas fort assuré que la considération de 
ces circonstances pourra conduire a des découvertes trés impor- 
tantes”, he wrote or rather dictated (“I am sure that this will 
lead to very important discoveries”: Eu.I[V A-5.498 
= Lag.X1V.238|1773). In his memoir of 1775 described 
above, after dealing with Lagrange’s reduction theory, he 
pointed out how this confirmed in part his former guesses 
and concluded: “Thus all the “theorems” which I formulated 
long ago in vol.XIV of the old [Petersburg] Commentaru 
have acquired a much higher degree of certainty .. . and 
there seems to be no doubt that whatever in them is still to 
be desired will soon receive a perfect proof” [“Nunc igitur 
omnia theoremata, quae ... olim in Comment.veter. Tomo X1V 
dederam, multo matorem gradum certitudinis sunt adepta .. . atque 
nullum dubium esse videtur, quin mox, quod in hoc genere adhuc 
desideratur, perfecta demonstratione muniatur”: Eu.1-4.195 in 
E 598|1775]. 

He was not far wrong. But mathematics in those days was 
moving ahead at a more leisurely pace than it does now. 
Proving the law of quadratic reciprocity still took more than 
twenty years; and it took Gauss to do it. 


§X. 


Fermat had stated, not only that every prime p = 4m + 1 
is a sum of two squares, but also that it can be written 
thus in only one way; and a converse to this can easily be 
extracted from his further statements on this subject (cf. 
Chap.II, §§VIII and IX). Thus those results contain implicitly 
a criterion for an integer 4m + | to be a prime. 
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Euler became aware of this long before he could prove 
it. In 1742 he wrote to Goldbach that 4m + 1 is prime if 
and only if it can be written as a® + 5? in one and only one 
way (“unico modo”: Corr.1.134—135), illustrating this by the 
example of the “Fermat number” 9” + 1 (nota prime, as 
he knew since 1732; cf. §I1V) which can indeed be written 
as 


(21°)? + 1 = 62264° + 20449”. 


In his first memoir on sums of two squares, Euler qualified 
his earlier assertion by saying that 4m + | is prime “if and 
only if it can be written in one and only one way asa” + 6? 
with a prime to 6” (Eu.I-2.314 in E 228|1749). Thus, if we 
again call a representation n = a* + b* of an integer n by 
a sum of two squares a proper one if a is prime to 6 (cf. 
above, Chap.II, §1X), Euler appears to say that 4m + 1 is 
a prime if it has one and only one proper representation 
as such a sum. Neither one of his formulations is exact; as 
he must surely have known, 45 = 36 + 9, 25 = 16 + 9, 
125 = 121 + 4 offer counterexamples. In the fashion of 
his time he must have regarded such exceptions as “proving 
the rule”, or rather as occurring only in cases too obvious 
to deserve mention. To be quite precise, he should have 
said that an integer 4m + 1, with m > 0, is prime if and 
only if it has one and only one representation as a sum of 
two squares, and that representation is proper; in this for- 
mulation one must regard a? + 0° as a representation of 
a’, but not a proper one. 

In that memoir, Euler applied his criterion to 82421, 
100981, 262657 which are prime, and 1000009, 233033, 
32129 which are not (loc.cit. pp.32 1-327); here he notes that 
32129 has the unique representation 95° + 152° (but this 
is not proper), while in the case of 233033, which is not a 
sum of two squares, he was using a result for which he had 
no proof as yet, viz., that every prime p = 4m + 1isasum 
of two squares; as we have seen, this was proved shortly 
after that paper was written (cf. above, §§Vb and IX). 

Of course, given a large number n = 4m + 1, it is im- 
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practical to search for its representations n = a* + 6° by 
mere trial and error; shortcuts have to be devised. This is 
done in two papers (Eu.I-3.1-45 = E 283]1760, Eu.I-3.112—- 
130 = E 369|1765) devoted chiefly to the search for large 
primes by means of the above criterion; Euler’s idea is to 
use congruences. For instance, if n = 2 (mod 3), a and b 
must be prime to 3; if m=2 (mod 5), a and b must be = +1 
(mod 5); if n = 5 (mod 8), a or b must be = 2 (mod 4); etc. 
Using such remarks, Euler is able, in many cases, to reduce 
the amount of trials to manageable size. 

In particular, if n = a* + 5° is odd, one of the numbers 
a, b must be odd and the other even, so that one can write 
n = a’ + 4c°. Consequently the primality criterion applies 
equally well to the form X* + 4Y?, provided we agree to 
call proper a representation n = ya® + vb* of an integer n 
by the form 4X? + v¥? if and only if a is prime to vb and 
bis prime to pa. Similarly, assume that n = a? + (36) with 
a prime to 3b; then n is = 1 (mod 3); if at the same time we 
have n = c? + d’, either c or d must be = 0 (mod 3); from 
this it follows that the primality criterion is also valid for 
the form X? + 9Y*. Using congruences modulo 8 and modulo 
5, one finds in the same way that it applies also to X° + 
16Y? and to X*? + 25Y?. In part, these observations are 
implicit in the paper of 1765 quoted above. 

Eventually Euler found that the reach of his criterion 
extends well beyond the form X* + Y?. One main point in 
that criterion was that no prime can be a sum of two squares 
in two different ways; this fact depends upon the result 
which appears as Theorem 2 in Euler’s letter of 1747 to 
Goldbach (Corr.I.416), as Proposition 1 in his memoir of 
1749 (Eu.1-2.302), and which has been reproduced above 
as lemma 2 of Chap.IJ, §VIII. As Euler discovered when 
he directed his attention to the forms X? + 2Y”, X? + 3Y°, 
most of the theory of the form X* + ¥’, including that theo- 
rem and its proof, can be rather easily extended to those 
two forms (Eu.I-2.459-492 = E 256|1753, Eu.1-2.556— 
575 = E 272|1759; cf. above, §1X, and Chap.II, §XII). In 
particular, they both give rise to a primality criterion similar 
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to the one discussed above; Euler used this, for instance, 
for showing that 67579 is prime and that 40081 is not (Eu.I- 
2.490—491). 

At some point Euler must have become aware that the 
proof for the crucial result quoted above (lemma 2 of 
Chap.II, §VIII) retains its validity, not only for all forms 
X*? + NY?, but even for all forms wX? + v¥?, so that in 
particular, if ~ and v are positive, no integer can be a prime 
if it can be written as wa* + vb* in two different ways. In 
1778 Euler’s assistants wrote up several proofs or sketches 
of proofs for this, along with prescriptions for factoring an 
integer written as a” + vb* in two different ways (Eu.I- 
3.422-423, Eu.J-4.271, Eu.J-4.305-310, etc.); they are 
clumsily written and hardly convincing. Actually Euler’s 
proof of 1747, reproduced in Chap.II, §VIII, can easily be 
adapted to this general case. Take first n = wa? + vb’; 
assume that gq = px” + vy" is a prime divisor of n, not 
dividing wv. In view of an identity well known to Euler (cf. 
e.g. Art.179 of his Algebra: Eu.I-1.424), we have 


ng = (wax+vby)? + wv(ay¥ bx)’. 
At the same time, qg divides the integer 
ny — b’¢ = play—bx)(ay+ bx); 
as it is prime to pw, it must divide one of the integers 
ay + bx; as gq’ divides ng, this implies that q divides 
pax + vby. Writing 
pax + vby = qu, ay + bx = qv, 
we get 
n 
q 
In particular, for w > 0, v > 0, wv > 1, this shows that 
n = qimplies u = +1, v = 0; therefore no prime can be 
written in two different ways as wa®> + vb, as was to be 
proved. Of course the same conclusion would be reached 
more easily nowadays by using some ideal-theory in the 


field Q(V — wv), or rather in the ring Z[V — wr]. 


=w+ pur. 
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Undoubtedly the observations referred to above, together 
with Euler’s experience about large primes, must have 
prompted him to look for values of « and v for which the 
last-mentioned result has a converse; such values would 
then give rise to primality criteria, generalizing those he 
knew already. This amounts to asking for the pairs (y, v) 
such that every odd integer, prime to wy, is prime whenever 
it has one and only one representation as ua” + vb? and 
that representation is proper. This problem must have kept 
Euler and his assistants busy for some time, until an answer 
was found in 1778, and a list could be drawn up of 65 
numbers from | to 1848 such that the pair (wu, v) has the 
property in question if the product pv is one of the integers 
in that list; he called such numbers “numeri idonei” (“suitable 
numbers”, i.e. suitable for testing the primality of large 
integers). In May 1778 Euler sent a brief account of those 
discoveries (Eu.1-3.418—420 = E 498/1778) to his friend 
and former Berlin colleague, the Swiss mathematician Bé- 
guelin, who communicated it to Lagrange; both asked for 
more details. These were sent on 30 June 1778 by Euler’s 
young friend from Basel, Nicolas Fuss (Eu.I-3.420—-428 
= E 708a). Fuss had already sent a similar report to Daniel 
Bernoulli in Basel. Bernoulli’s less than enthusiastic answer, 
of 18 March 1778, was as follows: 

“... Ce que vous me dites tant de votre part que de celle de M. 
Euler est sans doute infiniment plus sublime; je veux parler du 
beau théoréme de M. Euler sur les nombres premiers et de sa nouvelle 
méthode pour examiner tel nombre qu’on propose, quelque grand 
quil pussse étre, s'il est premier, ou non. Ce que vous vous étes 
donné la peine de me dire sur cette matiére m’a paru fort subtil et 
digne de notre grand maitre. Mais ne trouvez-vous pas que c'est 
presque faire trop d'honneur aux nombres premiers que d’y répandre 
tant de richesses, et ne dott-on aucun égard au gott raffiné de 
notre stécle? Je ne laisse pas de rendre justice & tout ce qui sort de 
votre plume et d’admirer vos grandes ressources pour surmonter 
les difficultés les plus épineuses; mais cette admiration se redouble 
quand le sujet peut mener a des connaissances utiles. Je range dans 
cette classe les profondes recherches dont vous me parlez, sur la 
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” 


force des poutres...” [“what you tell me, on your behalf and 
that of Mr. Euler, is infinitely more exalted, no doubt; I 
mean Mr. Euler’s beautiful theorem on prime numbers and 
his new method for deciding whether any given number, 
however large, is a prime or not. What you have taken the 
trouble to tell me about this question seems to me very 
ingenious and worthy of our great master. But, pray, is it 
not doing almost too much honor to prime numbers to 
spread such riches over them, and does one owe no deference 
at all to the refined taste of our time? I hold in due esteem 
whatever comes from your pen and admire your high ability 
to overcome the thorniest difficulties; but my admiration 
is doubled when the topic leads to some useful piece of 
knowledge. That includes, in my view, your deep researches 
into the strength of beams . . .”: Corr.II.676-677]. 

This was nothing new to Euler; well he knew how little 
interest his contemporaries, with the sole exception of La- 
grange, had taken in his arithmetical work (cf. Chap.II, 
8X VII). Anyway, on 16 March 1778 (almost the day of Ber- 
noulli’s letter to Fuss), he had three papers on his “numeri 
idone.” and their use presented to the Petersburg Academy 
(Eu.1-4.269-289 = E 708; Eu.1-4.303-328 = E715; Eu.!- 
4.360—394 = E 719); one more was added on 20 April 
(Eu.1-4.395-398 = E725). They are so ill coordinated with 
one another, and some of the formulations and proofs in 
them are so confused and defective, that one is tempted to 
attribute them to a variety of assistants over whom Euler 
was not keeping close control. To try to reconstruct from 
these papers a full theory of the “numeri idonet” would be 
a hopeless task®; for our purposes the following brief account 
will suffice. 

Euler’s list consists of sixty-five integers, out of which 


* Such a theory can be found in the conscientiously written paper of 
F. Grube, Zeitschr. fiir Math. u.Phys.X1X (1874), pp.492-519, along with 
an exposition and a detailed criticism of Euler’s papers; it is based on 
Gauss’s theory of quadratic forms. Cf. also J. Steinig, Elem.d.Math.XXI1 
(1966), pp.73-88. 
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thirty-seven are squarefree, the rest consisting of products 
of some of those with 4, 9, 16, 25 or 36; as to the latter, 
specific rules are given by Euler (Eu. I-4.280-288, Theor. 1 
to 9; cf. F. Grube®, §10). As Gauss was to observe incidentally 
some twenty years later in the Disquasitiones (Art.303), Euler’s 
numbers are precisely those integers N for which every 
“properly primitive” positive form aX” + 2bXY + cY? with 
the “determinant” 6* — ac = —N (i.e. every such form 
where a, 2b, c have no common divisor) is “anceps”, this 
being tantamount to saying that it is equivalent either to a 
form pX? + vY? with wv = N, or toa form 


u(X+¥)? + 4(X-Y¥)? 


with py = N and pw = v (mod 2). For N squarefree, an 
equivalent definition would be to say that N, if squarefree, 
is “idoneus” if and only if every prime divisor p of X? + NY’, 
prime to 2N, is such that either p or 2p (or both) can be 
expressed as pa” + vb? with wv = N. One may well wonder 
whether Euler became aware of this last property, which 
would surely have appealed to him in view of his wide ex- 
perience with forms such as 4X* + vY*. In any case there 
can be no doubt that he must have accumulated a good deal 
of numerical evidence before singling out the “nwmeri idonet” 
for his purposes. 

We shall not attempt to examine the definitions proposed 
for the “numeri tdonet” in Euler’s papers, nor the criteria 
described there for deciding which numbers are such; the 
latter, involving the examination of the integers N + a? for 
a” < 3N, must have been based on some process of descent, 
akin perhaps to Lagrange’s method of reduction. The fact 
that the numbers on Euler’s list are precisely those which 
answer Gauss’s simple definition shows that, on this occasion 
also, his instinct was sound. 

What particularly pleased Euler about his numbers is that 
they allowed him to discover quite large new primes; for 
instance, using the largest one 1848, he could easily establish 
the primality of 18518809 = 197° + 18480000 (Eu.1-3.420, 
Eu.1-4.385—-389). He also tried to extend his list beyond 
1848, testing, as he tells us, all integers up to 10000 and 
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beyond. To his utter astonishment, he drew a blank; that 
such a naturally defined set of numbers should apparently 
be finite, as he now came to conjecture, was a novel experience 
to him (“Insigne istud paradoxon . . . circa nullam adhuc aliam 
seriem observatum esse memini”: Eu.1-4.396 in E 725; cf. Eu.l- 
3.420 in E 498). To us it comes as no surprise, since we 
know that the number of “classes”, for a “determinant” 
—WN, increases with N faster than the number of decom- 
positions N = pv of N into two factors. For a formal proof 
that the number of “numeri idonez” is finite, cf. W. E. Briggs 
and S. Chowla, Can. J. of Math. 6 (1954), pp. 463-470. It 
does not seem to be known whether there is any such number 
beyond Euler’s list. 


§XI. 


Euler’s interest in number theory had first been awakened 
by Fermat's statement about sums of four squares (cf. above, 
§IV); but he had sensibly postponed taking this matter up 
until he had made sufficient progress with sums of two 
squares. From 1747 onwards, we find more and more ref- 
erences to sums of three and of four squares in his corre- 
spondence (cf. §Vd); but no real progress could be made 
until he discovered (perhaps by sheer guesswork: cf. Eu.I- 
6.312 in E 407|1770) the famous identity for products of 
sums of four squares: if 


m=@+P4+ C40, n=ptPtre+s, 
then 
mn = A® + B2+C? + D? 
where one has put 


A =ap+ bq + cr+ds, B= aq — bp — os + dr, 
C = ar + bs — ch — dq, D = as — br + cq — dp; 


obviously, then, one has also: 


CMC RACRAC! 


(Corr.1.452]1748; Eu.1-2.368~-369 in E 242|1751). 


Euler 227 


In the following year, after reaching complete success 
with sums of two squares (cf. above, §8Vé and IX), Euler 
tried to apply the same method, combined with the above 
identity, to sums of four squares (Corr.1.495~497|1749, 
5211750, 527|1750, and Ew.I-2.338-372 = E 2421751); 
clearly this required the following two steps. 

Firstly he had to show that every prime divides a sum of 
four squares. As to this he proved more, showing that, if p 
is any prime, the congruence x” + y? + z2?=0 (mod p) has 
a non-trivial solution, i.e. one other than (0, 0, 0); later on, 
in his definitive paper on the same subject (Eu.I-3.218— 
239 = E 445|1772), he even extended this to an arbitrary 
congruence Ax? + py? + vz?=0 (mod p), where X, w, v are 
prime to p; in fact, as he observed in substance, the function 
x—> x” (mod p) takes (p+ 1)/2 distinct values, viz., 0 and the 
quadratic residues modulo p; the same must therefore be 
true of the functions x > Ax” and x > — yx" — v, so that 
those two sets must have at least one element in common. 

Secondly it had to be shown that any prime which divides 
asuma’ + b? + ¢? + d® (where a, b, c, d have no common 
divisor, while some of them may be 0) is itself such a sum. 
Here Euler was unable to adapt the proof he had used for 
sums of two squares; all he could do was to show that every 
prime, and consequently every integer, is at any rate asum 
of four rational squares. In fact, assume that this is not so, 
and let p be the smallest prime which cannot be expressed 
as such a sum. Let (a, b, c) be a non-trivial solution of 
x” + y° + z?=0 (mod fp); here one may assume that a, b, ¢ 
are 20 and <p/2. Put 


N=a@?+0? +0? =N', 


so that N < 3p*, N' < p. As all prime factors of N’ are <p, 
they are sums of four rational squares, and so is N’ in view 
of Euler’s identity; for the same reason, p = N/N’ is then 
a sum of four rational squares. Between that result and 
Fermat’s theorem, there was still, as Euler realized, a wide 
gap which he found himself unable to bridge (cf. Appendix 
II). 
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There the matter rested for more than twenty years, until, 
in 1772, he received from Lagrange the memoir where 
Fermat’s assertion was finally proved in full (Lag. III.189— 
201; cf. Eu.IV A-5.492); that proof was based on Euler’s 
work on sums of two squares. Euler must soon have taken 
up the matter again; in a brilliant paper, presented to the 
Petersburg Academy on 21 September 1772 and first pub- 
lished in the Nova Acta Eruditorum of 1773 (Eu.1-3.218-— 
239 = E 445), he begins by congratulating Lagrange on 
his achievement, then, rightly describing Lagrange’s proof 
as “far-fetched and laborious” (“nimis longe repetita et vehe- 
menter operosa”: loc.cit.p.218), he proceeds to give a new and 
elegant variant of the proof for sums of two squares 
(thid. pp.222—224), showing finally that it applies equally well, 
not only to the forms X* + 2Y°? (ibid.pp.225—226 resp. Eu.1- 
3.275 in E 449|1772) and X® + 3Y* (ibid.pp.226—-227), but 
also to X*® + ¥* + Z? + T* (ibid.,pp.230—231). In the latter 
case, the proof is as follows. 

Let p be an odd prime divisor of a?, where the a;, for 
2 = 1,2,3,4, are integers without a common divisor which 
may be assumed to be 20 and <p/2. Putting Za? = pm, we 
have m < p. If m = 2, two of the a;, say a, and a2, must be 
odd, the others being even; then we have 


_ {ata GQ, — Qe a3 + a4 , a3 4 2 
= (54) + (58) + at) + As); 


and p is a sum of four squares; thus we may assume m > 2. 
For each i, put a; = 6; + me; with |b| < m/2; then 5d? = 0 
(mod m), and we can write >b? = mn. As the a; have no 
common divisor, not all the ); can be 0, and not all can be 
+m/2, so that 0 < 3b? < m® and 0 < n < m. Now apply 
Euler’s identity to Da? and X;; this gives m?pn = TA?, with 
A, = Xajb;, and Ae, A3, Ay given by 


Ag = Ad5 _ agb, _ asb4 + a4ds = m(c)b5 _ Coby _ 6304 + C43) 


and two similar formulas for A; and A4. As m°pn and Aj, 
A3, Aj are multiples of m”, so is Aj; therefore we can put 
A; = mB; for alli, and we get pn = B?. Call d the g.c.d. of 
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the B,;, and put B; = da; for each i; then d* divides pn. As 
0<n<m< p,d must be prime to p, and d’ must divide 
n; put n = d°m'. Thus we have pm' = Xa/j?; this is a sum 
of four squares with no common divisor which is a multiple 
of p smaller than pm. Proceeding in this manner, one gets 
p itself expressed as a sum of four squares, as was to be 
proved. 

More than a century later, Hurwitz was to write out a 
proof for the same theorem, based on the non-commutative 
ring of integral quaternions and on the fact that this ring 
has a euclidean algorithm (cf. his Math. Werke, 11.303~330); 
whether he was aware of it or not, this was in substance 
nothing else than a transcription of Euler’s proof in the 
language of quaternions. 


§XIL. 


Euler’s interest in “Pell’s equation” x° — Ny? = 1 had 
been aroused in 1730, apparently by his correspondence 
with Goldbach (cf. above, §1V). Already at that time he had 
observed that the successive solutions make up “recur- 
rent” sequences “conflated out of two geometric progres- 
sions” (Corr.1.37|1730; cf. infra, §XIII); in the case 
N = 8, for instance, this means that the successive integers 
yo = 0,9, = 1, yo = 6, etc., whose squares are “triangular 
numbers” (i.e. of the form $n(n + 1), or (x? — 1) if one puts 
x = 2n+ 1) are given by 


_ (8+2V2)" - 3-22)" 
a 


(Corr.1.36) and satisfy the recurrence relation 


net = On — Yn 
(Corr.1.30). Soon he was publishing a short paper on the 
subject (Eu.I-2.6—-17 = E 29|1733) devoted chiefly to the 
more general equation y* = ax? + bx + c (cf. infra, XID, 
but including a description of Brouncker’s method of so- 
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lution for “Pell’s equation” (cf. Chap.II, §XIII), which he 
attributes there, as he always did, to Pell and also, for no 
apparent reason, to Fermat; he illustrates it by treating the 
example x* — 31y* = 1 (loc.cit.p.13) and adds the important 
observation that the solutions of p> — Nq* = | provide good 
rational approximations p/q for VN (ibid. p.15); this had al- 
ready been known to Archimedes (cf. Chap.I, §VIIJ). 

As to continued fractions, they first occur in Euler’s cor- 
respondence in an altogether different context (Corr.1.58- 
59|1731), in connection with Riccati’s differential equation; 
soon, however, he becomes interested in them for their own 
sake (cf. Eu.I-14.187-215 = E 71/1737), observing inci- 
dentally that rational numbers have finite continued fractions 
(obtained by a process identical with the euclidean algorithm), 
that periodic continued fractions represent quadratic ir- 
rationalities, and noting also that the expansion of any real 
number into a continued fraction supplies the best rational 
approximations for that number. Some of this had been 
known to Huygens (cf. above, Chap.II, §X VII), but Euler 
was Clearly unaware of Huygens’s work on that subject. 

Since Brouncker’s solution of Pell’s equation x° ~ Ny? = 
1, and the continued fraction for VN, both supply good 
rational approximations for VN, it had to occur to Euler to 
compare them with one another. When he did so (Eu.I- 
3.73-111 = E 323]1759), he discovered that the two al- 
gorithms are in fact identical. 

To verify that fact, consider Brouncker’s method (the 
“methodus Pelliana”, as Euler always called it) as it has been 
described in Chap.I], §XIII. It consisted in constructing a 
sequence Fo, F, ... of quadratic forms 


F{X;, Xj+1) = A,X? ~ 2B; X;Xj+1 ~ CiX7 31, 
beginning with 
F (Xo, X1) = NXG — (nXo+X\)" 


and defined as follows. Denoting by [€], for every real number 
€, the integer m such that m < € < m + I, one takes n = 
[VN]; then, calling €,, for each 7 = 0, the positive root of 
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F((€, 1), and putting m; = [&;,], one takes for F;,, the transform 
of —F; by the substitution X; = m;X;4, + Xie. This gives, 
for alli = 0: 


B,; + VN B, + 
BAG x, = BAW 9, 3 [| ” 


A, i Ci41 = Aj 


At the same time, in view of the definition of the F;, we 
have 


VN =n 55 f= m+ Gi = 0,1, 2,...) 


so that VN is given by the “continued fraction” 
VN = n + I/(rm + 1/(m, + 1/(mg + =))). 


Thus Brouncker’s method includes the construction of the 
continued fraction for VN. 

In Euler’s presentation, on the other hand, one has re- 
currence formulas (oc.cit.p.82) which, in our notation, can 
be written thus: 


+ 
m, = Ee "), Bi.) = Aym; — B;, 


1 


N-B? 4; 


A: = 
i+1 A; 


= Aj, + m({B;—B;,,), 
with the initial values By = n, Ay = N — n®; they are at once 
seen to be equivalent to those given above. 

Once Brouncker’s method has been re-interpreted in this 
way, the facts proved in Chap.IJ, §XIII, have some further 
consequences. Firstly they imply that the continued fraction 
for VN is periodic; more precisely, there is p > 0 such that 
m,.; = m; for all7 2 Q; this occurs when p is taken so that 
F, = Fo; then A,_, = C, = 1. Write now F; = (A;, B;, Ci), 
as in Chap.II, §XIII, and put again G; = (C,, B;, A;); it has 
been shown there that G; can be derived from G;,, just as 
F;,, was derived from F;, with the same value for m,; this 
amounts to saying that if m; is as before, and if ;,; is the 
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positive root of G,;,,, we have 


Bi sy +n 
m= [Neil = "C.. . 
i+] 


As we have F, = Fy = (N—n’, n, 1), we have 
G, = (1,n,N—n’), N= nt VN, My, = 2n, 


so that F,_|(Xp_1, Xp) is the transform of — F,(X,, X,+1) by 
the substitution X,,, = X,-; — 2nX,; this gives F,_; = G,. 
This shows that G,_, is the same as Fo, therefore G, 2 the 
same as F,, and, by induction on 7, G,_; the same as F;_,; 
from this we get ny-; = € 1, Mmp_j-1 = m_1, for 1<i<p. 
This is the so-called “palindromic” property of the sequence 
(m,), by which is meant that we have: 


(mo, My, Mg, Mp2) = (mp -2, Mp—3, My —4) sees Mo). 


While Euler drew attention to these properties of the 
continued fractions for square roots VN, as well as to their 
use in solving Pell’s equation, there is no sign that he ever 
sought to back up his findings by anything more than ex- 
perimental evidence. He did mention that the values obtained 
by his process for the integers B;, A,;, m; are necessarily 
bounded (with B; = n, A; S B; + n,m, = 2n or m; <n for 
all i: loc.cit. p.83); from this he could at least have derived 
the conclusion that the sequence (m;) is periodic from a 
certain point onwards, but he failed to mention this, or did 
not bother to do so. 

Much later he came back to this topic (Eu.I-3.310- 
334 = E 454|1772), extending his observations to the con- 
tinued fractions for other quadratic irrationalities and in 
particular for square roots Vi where r is any rational number 
>1; as to the Jatter, one can in fact prove the periodicity 
and the palindromic character of the continued fraction 
just as above, by taking as starting point for the sequence 
(F,;) the quadratic form 


F (Xo, X1) = 6X5 — a(nXo+X,)° 
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where one has putr = 6/aand n = [Vr]. In that same paper 
Euler also applied continued fractions to the determination 
of the minimum value of |F(a, 5)| for integral values of a 
and 8, not both 0, when F is any indefinite binary quadratic 
form. 

Just for once he was coming too late. In 1768, taking up 
number theory for the first tume, Lagrange had proved, 
still somewhat clumsily, all the basic theorems on Pell’s 
equation (Lag.1.671—731); not much later he gave what may 
be regarded as a definitive treatment of the subject, based 
on the continued fraction algorithm (Lag. 1I.494—496); it is 
essentially the one we have described in Chap.II, §XIH; it 
is contained in a long memoir (Lag.11.377~535|1768) pub- 
lished in Berlin in 1769, which reached Euler in 1769 or 
early in 1770 (cf. Eu.IV A-5.466), along with a number of 
other contributions to which Euler must have paid more 
attention. Lagrange’s solution, along with some further re- 
sults on the same subject, was inserted by him in his Additions 
(composed in 1771, but published only in 1773; cf. infra, 
Chap.IV, §II) to Euler’s Algebra; there he also gave a detailed 
treatment of the same minimum problem for quadratic forms 
which Euler was to discuss in his paper E 454. This, too, 
received scant attention from Euler (cf. Chap.IV, 8III, and 
above, §1X). When in his later years Euler came back to the 
topic of Pell’s equation (Ew.I-4.76-90 = E 559/1773), he 
added nothing of substance to what by that time was already 
public knowledge on that subject. 


§XIII. 


Already in 1730, and in all his subsequent work on “Pell’s 
equation” (cf. above, §XI]), Euler used to discuss that topic 
within the broader framework of the diophantine equations 
of degree 2 in two unknowns, to be solved either in integers 
or, as Diophantus had done, in rational numbers. For many 
years he confined his attention to the type y? = ax? + 
Bx + y which had been studied by Diophantus (cf. Chap.I, 
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§X). In 1759 (or perhaps in1763, since it seems that the 
manuscript was revised once) he indicates briefly that he 
has also extended his investigations to the more general 


type 
F(x, y) = Ax® + 2Bxy + Cy? + 2Dx + 2Ey + F = 0 


(Eu.1-3.76 in E 323|1759); but then the topic is dropped 
until 1772, when that equation is taken up for its own sake 
in a paper (Eu.I-3.297-309 = E 452|1772) perhaps intended 
as a supplement to Lagrange’s work on the same subject 
(cf. Chap.IV, $11). 

Except perhaps on one occasion, also in 1772 (cf. above, 
§1X), Euler always made clear that he could do nothing 
with a problem of that type unless at least one solution is 
somehow known beforehand (mostly “by guesswork”, “quasi 
divinando”: Eu.1-2.610). In the geometric interpreta- 
tion which had been known to Newton (cf. above, Chap. II, 
§XV), the method for solving rationally an equation 
F(x, y) = 0 of degree 2, once a solution (a, 5) is given, is to 
intersect the conic F = 0 with an arbitrary straight line 
going through (a, b); as we have seen (Chap.1, §X), this goes 
back to Diophantus. Euler does not mention that interpre- 
tation, but it may well have been at the back of his mind 
when he described that method in algebraic terms, at first 
in 1758 for the case y? = ax? + Bx + y, then in 1772 in 
the general case. Taking (a, 6) to be a solution, and inter- 
secting the curve F = 0 with the line 

x-a_ y-b 


p q 


(Eu.1-2.578 in E 279|1758; Eu.1-3.299 in E 452/1772), he 
obtains a point (a’, 6’) given by formulas 


a’ = hat pht+a, 
b' = vat+ pb +7, 


il 


where the coefficients are rational functions of p and q. In 
modern language, it is easily verified that the linear 
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substitution 


defined by 
(x, y) > (Axt py to, vxt pyt7) 


transforms F into itself; consequently its “homogeneous part” 


i.e. the substitution 
(X, Y) > (AX + pY, vX + pY) 
transforms into itself the homogeneous part 
F(X, Y) = ¥* — aX?* resp. Fy(X, Y) = AX® + 2BXY + CY? 


of F. In other words, Euler’s formulas determine auto- 
morphisms of the form Fy. 

As Euler knew, nothing need be added to the above for- 
mulas if the question is to obtain all rational solutions of 
F = 0; but finding integral solutions is another matter. As 
to this, his method (already in 1733: Eu.1-2.7 in E 29|1733; 
cf. Corr.1.36|1730) can be described as trying to construct 
a linear substitution { with integral coefficients, transforming 
F into itself; if this can be done, then &, applied repeatedly 
to one solution in integers, may be expected to produce, if 
not all, at least infinitely many such solutions, provided & 
is not of finite order. Incidentally, the substitution S defined 
above cannot be used for that purpose, since it is of order 
2; the failure to perceive this circumstance (a failure one 
may ascribe to Euler’s assistants rather than to himself, since 
he had not committed it in 1758; cf. Eu.I-2.582, Coroll.2, 
in E 279|1758) vitiates the first part of his paper of 1772 
(Eu.1-3.301—303). 

In modern language, if 2 is to be such as Euler requires, 
its homogeneous part X» must belong to the group ly of 
the automorphisms of F, i.e. of the substitutions with integral 
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coefficients and determinant Ap — pv = | which transform 
Fy into itself; given such a Xo, there is a unique & (not nec- 
essarily with integral coefficients) which transforms F into 
itself. Following Euler, we exclude the essentially trivial case 
where the “determinant” A = B? — AC of Fy is 0, i.e. where 
the conic F = 0 is a parabola. Then % is deduced from Xp 
by adjusting o, 7 so that % leaves invariant the center of the 
conic F = 0. 

Take first the case of the equation y* = ax* + Bx + y; 
with Euler we assume, not only that a@ is not 0, but also that 
it is >0 and not a square, since otherwise the problem has 
only finitely many solutions which can be found by trial and 
error. Then Euler’s calculations (Eu.1-2.8—9 in E 29|1733) 
show that the group I) of the automorphisms of Fy = 
Y? — aX? consists of the substitutions 


where (p, g) is any solution of “Pell’s equation” p — aq’ = 1; 
as he was to observe (Eu.I-2.600—601 in E 2791758), this 
fact is closely related to “Brahmagupta’s identity”, which 
he may well have first noticed precisely in that context (cf. 
above, §VIII). 

With &» as shown above, the corresponding substitution 


> is given by 
x= (2 q (o- DBP), 
ag p= Bg/2 


iterated, this gives: 


32 = (": ag? pq Br 
2apq = ptag’ Bpq 
which has integral coefficients. Thus, using either the group 
generated by 2, or at any rate the group generated by 2°, 
one can always, from one solution in integers, derive infinitely 
many others. Lagrange was later to prove, in substance, 
that all the solutions can be derived thus from only finitely 
many (Lag.11.512—522, 704-705); Euler never raised this 
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question. What aroused his interest was a different matter: 
what is the nature of the sequence (a,, b,) generated by 
applying a substitution such as &, and its iterates &”, to a 
given solution (do, b9)? The answer, as he had discovered 
quite early (Eu.1-2.10 in E 29|1733; cf. Corr.1.30,36|1730) 
was that (a,), (b,) are “recurrent sequences”. 

This concept had been studied in some detail by Daniel 
Bernoulli in a paper written in 1728 or 1729 (and published 
in 1732: D.Bern.11.49-64), following work on the same subject 
by A. de MOIVRE and others including Daniel’s cousin 
Nicolas Bernoulli and Goldbach (cf. ibid.p.49). Euler must 
of course have been familiar with Daniel’s paper from the 
time of its composition; he was to give a masterly exposition 
of the whole topic in Chapters IV, XIII and XVII of his 
Introductio of 1748 (Eu.I-8). An infinite sequence (Ao, Aj, 
Ag, ...) is called “recurrent” if it satisfies a linear relation 


Anam = L(Ansm—1 An+m~2 tess A,) 


for all n = 0; this is equivalent to saying that its “generating 
series” ¥6A,x" is the expansion of a rational function P(x)/Q(x) 
at x = 0, with Q(x) = 1 — L(x, x’, ..., x”), P(x) being a 
polynomial of degree < m; that statement has to be slightly 
modified if L, instead of being homogeneous, has a constant 
term. In particular, if Q has no multiple root and has the 


simple roots €, &, ..., En, one can write 
P(x) _ 5 hy 
Qe) T1- &x 


and consequently 
A, = > La 
1 


so that, in Euler’s words, the sequence (A,) can be said to 
be “conflated out of m geometric progressions” (cf. above, 
§XII). 

Now, coming back to the equation y* = ax? + Bx + y, 
take first, as Euler does (Eu.I-2.589—590 in E 279|1758) the 
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case B = 0. Then the solutions (a,, b,,) derived from a given 
one (do, 9) are obtained by applying to it the iterates 25 of 
the substitution 2» defined above. This gives, for n > 0: 
Qn41 > pan + gon bat = agan + pois 
Qn+2 = pansitqbnais bisa Oqay +1 + poner, 


I! 


and therefore, since p> — aq? = 1: 
an+2 = Q2pan+1 ~ any bse = 2pbr+i ~~ by. 


From the theory of recurrent sequences (“ex doctrina serierum 
recurrentium”: ibid.p.584) it follows now that they can be 
written in the form 


a, = (r+sVa)(ptqVa)” + (r—sVa)(p—qvVe)" 


with suitable rational ry and s; there is a similar formula for 
b,. For 8B # 0 one gets a corresponding result (¢bid. p.584— 
585). 

As to the general case of an equation F(x, y) = 0 of degree 
2, Euler assumes that the “determinant” A = B? — AC of 
Fo is >0 and not a square; otherwise, as he probably knew, 
the problem has only finitely many solutions. Then, with 
the same notations as above, it turns out that the form 
Fy admits the automorphism 


where (p, g) is a solution of p* — Aq? = 1; here again this 
can be extended to an automorphism 2 of F itself (Eu.1- 
3.76 in E 323|1759; Eu.I-3.306 in E 452|1772) where the 
coefficients o, 7 of & appear as fractions with the denominator 
A, and need not be integers. Did Euler know that in this 
case too the coefficients of * are integers? This is not men- 
tioned. On the other hand, he proves at length that the 
successive transforms (a,, 5,) of a given solution (ao, bo) by 
the iterates of } are expressed by formulas of the type 


Qn = (r+sVA)(p+qVd)” + (r—sVA)(p—qvVA)" ~ 2r + ao. 
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This time, thanks to a newly acquired technique (cf. infra, 
§XI1V), the proof makes no appeal to “the theory of recurrent 
series” (Eu.1-3.306-309 in E 452|1772). 


SXIV. 


Already Fermat had applied the theory of quadratic forms 
to diophantine problems (cf. Chap.II, XII). This same idea 
was bound to occur to Euler when he took up the study of 
Fermat’s observations on Diophantus, around 1748 (cf. 
above, §1V). One first instance of this is given by a paper 
(Eu.1-2.223-240 = E 167/1748; cf. above, §1V) where he 
makes use of results on the form X® — 2Y? (which he was 
far from being able to prove at the time) in order to deal 
with a diophantine problem proposed by Fermat. At that 
same time his attention was attracted to Fermat’s statement 
on x” + y" = z" (Corr.I.445!1748); in 1753 he could announce 
to Goldbach that he had solved the case n = 3 (Corr.1.618). 
As he indicated some years later (Eu.1-2.557 in E 272|1759), 
this was based on the theory of the form X? + 3Y*, which 
he had not yet completed but with which he had already 
made decisive progress (cf. Eu.I-2.481—482, Scholion, in 
E 256|1753, and above, §1X). Undoubtedly his proof for 
the case n = 3 of Fermat’s equation must have been in 
substance the one which has been described in Chap.II, 
8XVI. Again in 1754 he applied the theory of the same 
form to the equation x° + y? + 2 = # (Eu.I-2.428-458 
=E 255|1754; cf. also Eu.I-2.557); his main result there, 
translated into modern language, is that the surface defined 
by that equation in projective space is rational; from our 
point of view it belongs to algepraic. geometry. 

His proof of 1753 for x? + y’ = z° was never published; 
but, by the me he decided to insert it into the last section 
of his Algebra (cf. above, 8Vc), he had worked out a new 
technique for dealing with quadratic irrationalities and using 
them in the theory of binary quadratic forms. Similar ideas, 
and even more far-reaching ones, had just occurred to La- 
grange in connection with his algebraic work (cf. Lag. 11.522- 
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535). This is how Euler wrote to Lagrange on discovering 
this coincidence: 

“Pai aussi fort admiré votre méthode d’employer les nombres 
irrationels et méme les imaginatres dans cette espéce d’analyse, qui 
est uniquement attachée aux nombres rationels. Il y a deja quelques 
années que j'ai eu de semblables idées; ... ayant publié ict une 
Algebre complette en langue russe, j’y at developpé cette matiére 
fort au long, ov j'ai fait voir que, pour résoudre Véquation 


ax + nyy = (pp +nqq)’, 
on n'a qua résoudre celle-ci: 
x + Van = (ptqVv—n), 
Cet ouvrage s’imprime actuellement aussi en allemand . .. Maas 
je n’y at pas poussé mes recherches au dela des racines quarrées; 


et Vapplication aux racines cubiques et ultérieures vous a été réservée 
uniquement. C'est de la que j'ai tiré cette formule trés remarquable 


x + ny? + nnz> — 3nxyz 


dont les trois facteurs sont x + yn + Wn; d’oit Lon voit qu’on 
peut toujours aisément déterminer les lettres x, y et z pour que cette 
formule devienne un quarré, ou un cube. . . ou quelque plus haute 
puissance” {“I have greatly admired your method of using 
irrational and even imaginary numbers in this kind of analysis 
which deals with nothing else than rational numbers. Already 
for several years I have had similar ideas; . . . having pub- 
lished here a complete Algebra in Russian, I have developed 
this matter quite at length, showing that, in order to solve 


x? + ny? = (p?+nq’)’, 
it is enough to solve 
xt Van = (ptqV—n). 


A German translation is being printed . . . But I did not go 
beyond square roots; and the application to cubic and higher 
roots is yours only. That is where I found the remarkable 
formula 


x + ny® + n?2? ~ 3nxyz 
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whose factors are x + yWn + 2Vn2; from which one sees 
that x, y, z can always easily be determined so as to make 
that formula equal to a square, a cube ... or any higher 
power”: Eu.IV A-5.467|1770; cf. Art.191 of his Algebra, Eu.I- 
1.431]. Similarly, commenting upon the same method in 
1772, he wrote: “Haec methodus . . . eo magis est notatu digna, 
quod ex doctrina irrationalium est petita, cutus alioquin nullus 
videtur esse usus in Analysi Diophantea. Eximium autem huius 
doctrinae usum tam pridem in Algebra mea Ruthenice et Germanice 
edita fusius ostendi” (“This method is all the more remarkable 
since it comes out of the theory of irrationals which otherwise 
seems to be of no use in diophantine analysis. It has such 
use, however, and an eminent one, as I have shown more 
at length some time ago in the Algebra I have published in 
Russian and in German”: Eu.1-3.309 in E 452\1772]. 
So far as Euler was concerned, the new method consisted 
in splitting a quadratic form x + ny® into its factors 
—n, or a form ax? + cy into the factors xVa + 
Wer As long as this is done only in order to derive algebraic 
identities, the use of this method is of course unimpeachable. 
Such is the case when it is applied to the proof of “Brah- 
magupta’s identity” (Art.176 of the Algebra, Eu.I-1.422; cf. 
Chap.1, §VIID or the more general identity 


(ap® + ¢q*)(r? +. acs”) = a(pr+cqs)? + c(qr#aps)? 
(thid., Art.178, p.424). In the same way, by putting 
xVa + yV—e = (pVatqv—c)° 
(bid., Art.188, p.429), Euler concludes rightly that, for 


x = ap’ — 3cpq?, —-y = 3ap’g — cq", 

the number ax? + cq becomes a cube, viz., (ap* + ¢q 

What he is after with such identities, however, and what 
he needs for his treatment of the equation x° + y*® = 2° (cf. 
ibid., Art.247, pp.492, 493), is the converse, a converse which 
he has no hesitation in spelling out. According to him, if 
the form aX? + c¥? does not split into rational linear factors 
(i.e. if —ac is not a square), and if ax? + cy? is a cube while 


2)3 
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x is prime to y, then there must be two integers p, q such 
that x and y are given by the above formulas “because then 
xVa + yV—c and xVa ~ yV~c have no common factor 
and so must be cubes” (zbed., Art.191, p.431). For instance, 
if x* + 3y? is a cube while x is prime to y, then there are 
integers p, g such that 


x= p — 9pq?, —-y = 3p'q — 3¢° 
(tbid., Art.189, p.430). 

This last assertion is true; it is no other than the content 
of our lemma 5 in Chap.II, §XIJ; it must have been known 
to Fermat (cf. Chap.I1, 88X11 and XVI) and is at any rate 
implicit in Euler’s theory of the form X* + 3Y? as contained 
in his memoir E 272|1759 (Eu. 1-2.556—575) which is expressly 
designed to supply the foundation for Euler’s earlier treat- 
ment of the equation x° + y° = 2°; in fact, the result in 
question is all that is needed for that purpose. In his seek- 
ing to justify it by his new “method”, his instinct had not 
wholly misled him, but this was due to the fact that the field 
Q(V— 3) admits a euclidean algorithm (cf. Chap.II, Appen- 
dix I), so that the laws of ordinary arithmetic retain their 
validity for the integers in that field. In this matter, too, it 
took Gauss to set things right. 

Nevertheless a momentous event had taken place. Al- 
gebraic numbers had entered number theory — through 
the back-door. 


§XV. 


What we now call “elliptic curves” (i.e. algebraic curves 
of genus 1) were considered by Euler under two quite dif- 
ferent aspects, without his ever showing an awareness of 
the connection between them, or rather of their substantial 
identity. 

On the one hand, he must surely have been familiar, 
from the very beginning of his career, with the traditional 
methods for handling diophantine equations of genus | 
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(cf. Chap.I, §X, Chap.II, §XV, and above, §Vd). Also in 
connection with such equations he eventually came to re- 
construct some of Fermat’s proofs by descent (cf. Chap.II, 
§XVI, and above, §§Vd and XIV). Finally, late in life, he 
took up such problems more systematically, including the 
“double equations” which had been a favorite both for Dio- 
phantus and for Fermat, and the equations ®(x,y) = 0 of 
degree 2 inx and 2 iny which were perhaps his most original 
contribution to the whole subject. 

On the other hand he had inherited from his predecessors, 
and notably from Johann Bernoulli, a keen interest in what 
we know as “elliptic integrals” because the rectification of 
the ellipse depends upon integrals of that type; they were 
perceived to come next to the integrals of rational functions 
in order of difficulty. That the latter can always be integrated 
by means of logarithmic and inverse trigonometric functions, 
and of course of rational functions, came gradually to be 
recognized in the seventeenth and eighteenth centuries until 
it was accepted as an established fact (cf. Corr.I.50—57 and 
170-171); eventually Euler derived it from the so-called 
“fundamental theorem of algebra” (Eu.1-17.70-194 = E 
162,163]1748; cf. above, §VIJ). Functions depending ra- 
tionally upon x and a square root Vax + bx + ¢ could 
then be integrated by the “diophantine method”, i.e. by the 
same change of variable 


—— _ 2 
t= Vax? + bx + ¢ — xVa, x=——— 
b — 2tV/a 
which is used for the solution of the diophantine equation 
y? = ax’ + bx + cin rational numbers when a is a square 


(cf. Corr.II.190 and J.Bern.1I1.393). Consequently, when 
Euler, in his younger days, tried (vainly, of course) 
to “integrate” the differential dx/V1 — x? (Corr.1.47|1730), 
he must have been aware that this was almost the same 
problem as finding rational solutions for y? = 1 — x‘, or 
(what amounts to the same) integral solutions for Z? = 
X* — ¥*; thus, when he reconstructed Fermat’s proof for the 
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impossibility of the latter equation (Eu.I-2.38-58 = E 
9811 738), this must have helped to persuade him that the 
differential in question could not be integrated in terms of 
known functions. In the same year, in fact, while sending 
to his old teacher Johann Bernoulli his latest discovery about 
integrals of that type, he expressed himself as follows: 
“Observavi nuper insignem elasticae rectangulae proprietatem, 


in qua st abscissa ponatur x, est applicata = Vai—x) 
a’—x 


) et longitudo 


aa . . 
curvae = | Vax quae expressiones ita sunt comparatae, ut 
a—x 


inter se comparart nequeant. At st abscissa sumatur = a, inveni 
rectangulum sub applicata et arcu comprehensum aequale esse areae 
circult curus diameter sit abscissa = a; quae observatio miht quidem 
notatu maxime digna videtur” [“I have newly discovered a re- 
markable property of the elastic curve given by the equation 
y = f(x), with 


* dx 


FO Jo ar 


its arc-length from 0 to (x,y) is 


=| a’dx 
OS I Vai 


these two indefinite integrals have no relation to one another, 
but I have found that f(a)s(a) = 47a”, which seems to me 
most noteworthy”: Bibl.Math.(11)5 (1904), p.291|1738; cf. 
Eu.J-14.268 in E 122|1739, and Euler’s letter to Clairaut, 
Eu.lV A-5.114|1742]. That discovery was nothing else than 
Legendre’s relation between the periods of elliptic integrals 
of the first and of the second kind, specialized to the lem- 
niscatic case. 

As to Euler’s later discoveries about elliptic integrals, that 
story is well known (cf. e.g. A. Enneper, Elliptische Functionen, 
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Theorie und Geschichte, Halle 1876, and Eu.I-20, pp.VII- 
VIII). Here we shall deal with the aspects most relevant to 
our main theme. For proper background, cf. infra, Appendix 
Ill. 

Euler had closely followed contemporary work on those 
integrals, including more particularly that of MACLAURIN 
and of @ALEMBERT (cf. e.g. his letter to d’Alembert, Eu.IV 
A-5.252|1746). On 23 December 1751 the two volumes of 
Fagnano’s Produzioni Matematiche, just published, reached 
the Berlin Academy and were handed over to Euler; the 
second volume contained reprints of pieces on elliptic in- 
tegrals (Fag. 11.287—318) which had appeared between 1714 
and 1720 in an obscure Italian journal and had remained 
totally unknown. On reading those few pages Euler caught 
fire instantly; on 27 January 1752 he was presenting to the 
Academy a memoir (Eu.1-20.80—-107 = E 252) with an ex- 
position of Fagnano’s main results, to which he was already 
adding some of his own. 

The most striking of Fagnano’s results concerned trans- 
formations of the “lemniscatic differential” 

dz 
Vi = xt 
how he had reached them was more than even Euler could 
guess. “Surely his discoveries would shed much light on the 
theory of transcendental functions”, Euler wrote in 1753, 
“if only his procedure supplied a sure method for pursuing 
these investigations further; but it rests upon substitutions 
of a tentative character, almost haphazardly applied .. .” 
(“Hine certe theoriae quantitatum transcendentium insigne lumen 
accenderetur, st modo via, qua FAGNANUS est usus, certam 
methodum suppeduaret in huiusmodi investigationibus ultertus 
progrediendi; sed. . . tota substitutionibus precario factis et quasi 
casu fortuito adhibitis nititur . . .”: Eu.1-20.110 in E 263]1753). 

But, as he had written the year before: “To this end [the 
progress of Analysis] such observations are particularly val- 
uable as have been made almost by chance and discovered 


w(z) = 
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a posteriori, while no a4 priori explanation, no direct way of 
reaching them, has been perceived ... Of that kind are a 
few observations . . . which I have found in Count Fagnano’s 
newly published volume...” (“Ad hunc scopum imprimis ac- 
comodatae videntur eiusmodi observationes, quae cum quasi casu 
sint factae et a posteriori detectae, ratio ad easdem a priori ac per 
utam directam perveniendi. . . neutiquam est perspecta. . . Huius- 
modi autem observationes ... nonnullas deprehendi in opere Iil. 
Comitis FAGNANI nuper in luce edito”: Eu.1-20.81 in 
E 252|1752]. 

With characteristic generosity Euler never ceased to ac- 
knowledge his indebtedness to Fagnano (cf. e.g. his letter 
to Heinsius, PkU.109|1764); but surely none but Euler could 
have seen in Fagnano’s isolated results the germ of a new 
branch of analysis. His first contribution was to extend 
Fagnano’s duplication formula for the lemniscate to a general 
multiplication formula (Eu.1-20.100 in 252]1752). If, as he 
says in substance, z > u = f(z) transforms the differential 
@(z) into w(u) = nw(z), then the substitution 


L- /(1 ~uu)(1 — zz) 

we (1 + uu)(1 + zz) 
transforms @(z) into w(v) = (n+ 1)@(z). Soon he was to rewrite 
this in the equivalent form 


wVi- ui +uvyil ~ 2 
1 + uuzz 


(Eu.1-20.65—66 in E 251|1753) and to recognize that this 
implies 


(A) ZrVve= 


(A’) v= 


atv) = a(z) + a(n) 


when z, u are treated as independent variables, so that it is 
nothing less than the addition formula for the lemniscatic 
integral fw(z). 
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In this, his second paper on that subject, we already find 
a clear indication of the main ideas which he was to develop 
later about elliptic integrals. Of course he also explored 
some blind alleys; as such we may describe his persistent 
efforts to give geometric interpretations of his results in 
terms of the arc-lengths of conic sections and other curves 
(cf. e.g. Corr.1.568—569|1752, E 264|1755, E 639|1776); such 
are also the elaborate calculations in which he sought “a 
shorter and more natural way” (“ein kiirzerer und natiirlicherer 
Weg”: PkU.109|1764; cf. Eu.I-20.305-311 in E 345|1765) 
for proving what in substance was the addition theorem. 
On the latter attempt he has this comment, however: “I 
cannot deny that this is done so circuitously that one could 
hardly expect those operations to come spontaneously to 
anyone’s mind” [“Déffitert non possum, hoc per multas ambages 
esse praestitum, ita ut vix sit expectandum cuiquam has operationes 
in mentem venire potusse”: Eu.1-20.311]}. 

What seems to have struck Euler in the first place is that 
Fagnano’s work supplied him with one algebraic integral 
of the differential equation w(x) = w(y) (besides the trivial 
one x = y), this being given by x*y? + x7 + y? -— 1 = 0. 
Soon he must have noticed that his formula (A), or equiv- 
alently (A’), contained implicitly the general integral of the 
same equation; after replacing v, z and u, in (A’), by y, x 
and a constant c, the relation between x and y can be written 
as 


(B) cx y? +x + y = t+ OxyV 1 —<¢ 


(Corr.1.567|1752; Eu.1-20.63 in E 251|1753), which reduces 
to Fagnano’s for ¢ = 1. 

Perhaps Fagnano had somehow been guided by the anal- 
ogy with the equation 


dx dy 
a V1 = 


At any rate Euler used it quite consciously and systematically, 
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observing that this last equation had the general integral 
(the “complete” integral, as he preferred to call it): 


r+ ype + QxyiV 1 -¢ 


(Corr.1.567|1752); this is obtained of course by putting both 
sides of the equation equal to dt, then taking x = sin t,y = 
sin (t+ t9), ¢ = sin fy. As he observes at once, a similar treat- 
ment for w(x), w(y) “is not possible or at least not avail- 
able” [“huiusmodi comparatio in formulis transcendentibus 
JS dx/V1 — x? et f dy/V1 — y* locum non habet seu saltem non 
constat”: Eu.I-20.61]. Had he pursued this idea further, he 
would, as we know now, have discovered Jacobi’s elliptic 
functions, or at least, as Gauss did, the “lemniscatic” sine 
and cosine. Instead of this, after obtaining the desired in- 
tegral “not by any sure method but by guess-work” [“nulla 
certa methodo ... sed... . tentando vel divinando”: ibid. p.61], 
he merely verified (B) a posteriori by differentiating it and 
at the same time using it to express x in terms of y and y in 
terms of x. 

From then on, Euler’s handling of elliptic integrals always 
rested upon the consideration of relations ®(x,y) = 0 of 
degree <2 in x and <2 in y. Such a relation can be written 


P(x,y) = Po(x)y” + 2Pi(x)y + Po(x) = 0 


or also as 


D(x,y) = Qo(y)x* + 2Q.(y)x + Qo(x) = 0 


where the P; and Q; are polynomials of degree <2. By solving 
this either for y or for x, one sees that the curve defined by 
® = 0 is birationally equivalent to either one of the curves 
2 = F(x) and w® = G(y), with 


F = Pi — PoP», G = Qi -— Q,Qs. 


Thusa relation ® = 0 can serve to establish an isomorphism 
between two curves 2” = F(x), w” = G(y); this idea has been 
occasionally used by Euler (cf. Eu.1-20.69-71,77, 248-249). 
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Moreover, we have here: 


® \/C 
oe = Qoly)x + Qily) = + VGQ), 
aD 

G = Polly + Pils) = * VFG), 


i 
2 
i 
2 


so that, differentiating the relation ® = 0, one gets 


dx dy 


= + 


VF@) — -VG(y) 


(this being the differential of the first kind on the curve 
® = 0 in the “general case” when F and G are of degree 3 
or 4). We may also say that the isomorphism defined as 
above between the curves 2” = F(x) and w* = G(y) transforms 
the differentials dx/z and dy/w into each other if signs have 
been chosen properly. 

Almost invariably Euler confines himself to the case when 
® is symmetric in x and y; then he writes it as 


a + 2B(x+y) + v(x? +y") + Qdxy + Qexy(xty) + &x*y? = 0 


+ 


and calls it “the canonical equation” (£u.I-20.321 in 
E 347|1765; cf. Eu.J-20.71 in E 251|1753 and Eu.1-20.155 
in E 261|1755). Here F and G are the same polynomial, 
and, in Euler’s language, ® = 0 is an integral of the dif- 
ferential equation 


de _,_4% 
VFX) VFO) 


His main concern is then to determine ® when F is given; 
he does this by the method of indeterminate coefficients, 
which gives him for ® a polynomial containing an arbitrary 
constant, Le., in his language, a “complete” (or general) 
integral for the differential equation determined by F. In 
this he includes the case when F is of degree 2 or 1, even 
devoting a whole memoir to that subject (an elementary 
one, since dx/VF(x) can then be integrated by logarithmic 
or inverse trigonometric functions; cf. Fu.J-20.110-152 = 


+ 
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E 263|1753. In that case, as he shows, it is enough to consider 
relations ® = 0 of degree 2 in x and y, putting e = ¢ = 0 
in the above formula. In doing this he is merely pushin 
further the analogy he had noticed between dx/V1 — x? 
and dx/V1 — x*, an analogy which to us is best explained 
by saying that he is dealing with the various kinds of algebraic 
groups of dimension 1, defined over the field of real numbers 
(the circle being of course one of them). His real concern, 
however, is with the case when F is a polynomial of degree 
3 or 4, and more specifically with the case when F is an even 
polynomial of degree 4: 


F(x) = 1 + mx? + nx’. 


As he perceived, the general case can always be reduced to 
that one; in fact, this can always be done by a suitable 
substitution 


= xt p 
vx + p 


as he once sought to prove (Eu.I-20.303—304 in E 345|1765); 
his proof remained incomplete, since it only established the 
validity of this result over complex numbers, while real 
numbers are clearly what he had in mind. The matter was 
later cleared up by Legendre (cf. his Mémoire sur les tran- 
scendantes elliptiques, Paris, An If [= 1793], pp.9—-10). 

For F(x) = 1 + mx® + nx‘ (or, what amounts to the same 
except in number-theoretical contexts, F(x) = A + Cx? + 
Ex‘), Euler found that one may take for ® an even polynomial 
in x, y (for an @ priori justification of this fact, cf. Appendix 
ITI). Then the method of indeterminate coefficients leads 
directly to the “complete integral” 


0 = c& — x? — 9? + ne®x’y? + QxyV 1 + me? + net 


where c is the arbitrary constant (Eu. 1-20.67 in E 251|1753; 
Eu.J-20.155—159 in E 261/1755; Eu.1-20.310 in E 345|1765), 
and then to the addition formulas (formulas (3) and (4) of 
Appendix II; £u.I-20.69 in E 251; Eu.1-20.159 in E 261; 
Eu.J-20.311 in E 345). 
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In 1765 Euler went one step further, applying the method 
of indeterminate coefficients to the calculation of the “can- 
onical equation” ® = 0 in the general case 


F(x) = A + 2Bx + Cx? + 2Dx? + Ex’. 


From our modern point of view, this amounts to defining, 
in terms of a suitable parameter, the group of automorphisms 
of the curve [ given by y* = F(x). It is now well-known that 
this group can be identified (over the groundfield over which 
I is defined) with the so-called “Jacobian” of I; this is an 
elliptic curve 


Y? = 4X° — ix -j 


birationally equivalent to l over some algebraic extension 
of the groundfield but not necessarily over the groundfield 
itself; 2, 7 are no other than the invariants of the quartic 
polynomial F (cf. e.g. A.Weil, Coll.Papers 11.111-116). Of 
course such concepts lay far beyond Euler’s horizon. It is 
nonetheless remarkable that his calculations (Eu.I-20.321— 
326 in E 347|1765; cf. Eu.1-20.71—76 in E 347|1753) contain 
all the facts which lie behind the above statements, including 
the formulas for the invariants 7, 7 in terms of A, B, C, D, 
E. 

Finally it must be pointed out that in this § we discussed 
only the algebraic aspects of Euler’s work on elliptic integrals. 
Euler himself, however, just as Fagnano before him, always 
put much stress on the relations between integrals which 
accompanied his algebraic results. The arc-lengths of the 
lemniscate and of the ellipse and hyperbola had provided 
him with typical examples of so-called integrals “of the 
first kind” and “of the second kind”, respectively. Eventu- 
ally he extended his investigations even to the most gen- 
eral case of an integral of the third kind, 1.e. one of the 
form J(Z dz)/VF(z), where F is as before and Z is an arbitrary 
rational function of z (Eu.1-21.39-56 = E581|1775). Actually 
all such results are nothing more than special cases of Abel’s 
theorem combined with the algebraic addition formulas (cf. 
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Appendix III), so that we need not enter into that topic 
here. 


SXVI. 


During the years when Euler was pursuing most actively 
the subject of elliptic functions he seems to have left in 
abeyance the topic of diophantine equations of genus | (a 
closely allied one in our eyes). He only came back to it when 
composing the last section of his Algebra of 1770. From that 
time on, diophantine problems, those of genus | and a 
variety of others, were to become one of his favorite pastimes. 

Nine papers on such problems, composed from 1770 on- 
wards, appeared during his lifetime; 24 others, all written 
from 1777 to 1783, were printed posthumously. “I must 
confess”, he wrote in 1772, “that I derive nearly as much 
pleasure from investigations of this kind as from the deepest 
speculations of higher mathematics. I have indeed devoted 
most of my efforts to questions of greater moment, but such 
a change of subject always brings me welcome relaxation. 
Moreover, higher Analysis owes so much to the Diophantine 
method, that one has no right to dismiss it altogether” 
[“... fatert cogor me ex huiusmodi investigationibus tantundem 
fere voluptats capere quam ex profundissimis Geometriae sublimioris 
speculationibus. Ac st plurimum studi et laboris impendi in quaes- 
tionibus gravioribus evolvendis, huiusmodi variatio argumenti 
quandam mihi haud ingratam recreationem affere solet. Ceterum 
Analysts sublimior tantum debet Methodo Diophanteae, ut nefas 
videatur eam penitus repudiare”: Eu.1-3.174 in E 427|1772]. 

At first sight these papers, which take up so much space 
in volumes III, IV and V of Euler’s Opera Omnia, have the 
appearance of a collection of difficult exercises, more or 
less haphazardly assembled. One can admire Euler’s vir- 
tuosity in performing the most abstruse calculations, or rather 
(since he was practically blind at the time) in directing his 
assistants’ handling of them, without feeling tempted to 
scrutinize them in detail. On closer inspection, one notices 
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Euler’s continuing preoccupation with equations of genus 
1, to which he had already reserved a place of honor in his 
Algebra (cf. §Vd). In E 405|1770 and E 451|1772, for instance 
(Eu.1-3.148-171 and 282~296), some problems of little in- 
trinsic interest are solved by means of equations of the form 


y? = A+ Bx + Cx? + Dx’ + Ex! 


where A and E are squares; on each occasion he describes 
anew Fermat’s methods for finding infinitely many solutions 
of such equations (cf. Chap.II, §XV, and Euler’s Algebra, 
Eu.1-1.396), calling them “the usual methods” (“methedos 
consuetas”, Eu.J-3.158); perhaps his purpose there was merely 
to teach those methods to his assistants. He takes up re- 
peatedly two of Fermat’s most notable problems (problems 
(A) and (B) of Chap.II, §XV; cf. ibid., §X VI and Appendix 
V), amounting, as we have seen, to the equations 2X 4{_yta 
+Z’, or also (as Euler, following a hint from Leibniz, 
prefers to formulate it) to the pair of equations X + Y = U?, 
X®? + Y? = V‘*; in 1773 he outlines a solution by descent 
(Eu.1-4.96-98 = E 560!’, published in 1783), thus antici- 
pating to some extent Lagrange’s definitive treatment of 
the subject in a paper of 1777 (Lag.IV.377-398); in 1780, 
quoting Lagrange, he seeks to improve still upon his own 
and Lagrange’s treatment, but, failing to use descent, his 
new proof remains incomplete (Eu.I-5.77—81 = E 769). 

In 1777, and again in 1782, he seeks criteria for an equation 
X* + mX*Y? + Y* = Z?, and for a “double equation” 


X?4+ ¥? = Z?, X24 NY? =T?* 


to have infinitely many solutions (Eu.I-4.235-244 = 
E 696|1777; Eu.1-5.35-47 = E 755|1782; Eu.I-4.255-268 
= E 702|1777); not surprisingly, he fails (it is still an open 
problem) but does obtain non-trivial sufficient conditions. 
In 1780 he extends the latter question to double equations 


X? + mY? = Z°, X? + nY? = T°? 


(Eu.1-5.48-60 = E 758|1780); here his interest and our 
own turns to proofs of impossibility for the cases m = 1, 
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n = 3 or 4 and others equivalent to these two; one may 
well admire his beautiful technique (rather different from 
Fermat’s) for applying the infinite descent to such problems. 

More important to us are some papers, written in1780 
(Eu.1-5.82—115, 146-181 = E772, 773,777, 778), the first 
one of which bears the characteristically enthusiastic title 
“De insigni promotione Analysis Diophanteae” (“On some famous 
progress in Diophantine Analysis”: E 772). Along with a 
number of other papers by Euler on various subjects, they 
were all published in 1830 in volume XI of the Mémoires de 
VAcadéme des Sciences de Saint Pétersbourg (which had changed 
their title from Latin to French in 1809). It is this publication 
which gave Jacobi the occasion for writing his note of 1835 
with the no less characteristic title “De usu theoriae integralium 
ellipticorum et integralium abelianorum in analyst diophantea” 
(“On the use of elliptic and abelian integrals in diophantine 
analysis”: Jac.11.53—55; cf. Chap.II, §XVII], and for ex- 
pressing the view that Euler must have been aware of the 
connection between diophantine equations of the 
form y’ = F(x), with F of degree 3 or 4, and the elliptic 
integral fdx/VF{(x). 

To us it is indeed obvious that the addition formulas 
(formulas (3) and (4) of Appendix ITI) determine a rational 
solution (x’, y’) of the equation y’> = 1 + mx® + nx* when 
two such solutions (x, y), (a, 6) are known; it would indeed 
be surprising if this had never occurred to Euler as he re- 
peatedly wrote those same formulas and the multiplication 
formulas derived from them (cf. Eu.I-20.67,159,311, and 
above, §XV). Actually he gives no hint that he perceived 
such facts; luckily his papers of 1780 give some clue to his 
way of thinking. 

As we have seen in §XV, his way of dealing with elliptic 
integrals had been entirely based on his “canonical equations” 
of the form P(x, y) = 0, with ® of degree 2 in x and 2 in y. 
Now his “famous progress” in the paper E 772 consisted 
precisely in the introduction of such equations as models 
for elliptic curves, to replace the equations and “double 
equations” previously in use. That he did so in full awareness 
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of the connection between the two topics seems proved by 
his use of the same name (the “canonical equation”) for this 
equation ® = 0 in the new context (£u.1I-5.158 in E 778) 
and by his using for it precisely the same notations (Eu.I- 
5.153 in E 777, to be compared with Eu.1-20.321). 

To our eyes it is the perception of this connection (later 
made explicit by Jacobi) which was Euler’s “famous progress”, 
and it is something of an anticlimax to consider his further 
steps. Of course the question arises how to obtain a “can- 
onical” model (if one exists) when the problem is given 
otherwise. Algebraic geometry supplies a criterium for this; 
for a given curve I of genus 1, there is a model (x, y) = 0, 
with ® of degree 2 in x and 2 in 9, if and only if there are 
on I’ two non-equivalent rational divisors of degree 2 (cf. 
Chap.II, Appendix IT). For Euler, the curve is given by an 
equation y* = F(x), and the question, which he can only 
solve by trial and error, is to find polynomials P, Q, R of 
degree 2 such that F = P* + QR, after which he can take 


P(x, y) = Q(x)y* + 2P(x)y — R(x) 


(Eu.1-5.158). Once this is done, he uses the equation ® = 0 
to perform what we called “the ascent” (cf. Chap.II, 
§XV), ie. to construct new solutions out of any that may 
already be known, rightly claiming the new method to be 
far more efficient than the traditional one, which, as he 
says, not only “requires most tedious calculations” but “soon 
produces such enormous numbers that hardly anyone would 
undertake so much labor” [“‘nimis taedzosas ambages requirit 

. mox ad tam enormes numeros pervenitur, ut vix quisquam 
tantum laborem suscipere voluerit”: Eu.1-5.82 in E 772]. His 
procedure (an example of which has been given above in 
Chap.II, Appendix V) is as follows. 

As before, we can write: 


(x, y) = Po(x)y? + 2Pi(x\y + Po(x) 
= Qoly)x” + 2Qi(y)x + Qo(y) 


where the P;, Q; are polynomials of degree 2. If M = (x, y) 
is a solution of ® = 0, there is another one N = (x, y’), 
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where y’ is given byy + y’ = — 2P,(x)/Po(x), or alternatively 
by yy’ = Po(x)/Po(x); then there is a solution M’ = (x’, y’) 
given by x + x’ = —2Q)(y’)/Qo(y’) or xx’ = Qo(y')/Qo(y’). 
Repeating this process, one gets thus an infinite sequence 
of solutions M, M’', M", M", etc., which in general are all 
different. Another similar sequence can of course be obtained 
by exchanging the roles of x and y in this procedure. 

To understand what goes on here, it is best to use the 
concepts explained in Chap.II, Appendix IJ. On the curve 
of genus I defined by ® = 0, let A, A’ be the points where 
x = and B, B’ those where y = ©; puta = A + A’, 6b = 
B + B’'; these are the divisors of poles of x and of y, re- 
spectively; therefore they are rational. With the above no- 
tations, x takes the same value at M and at N, so that we 
have M + N ~ a; similarly we have N + M’ ~ b. Therefore, 
putting m = b — a, we have M’ ~ M + m, and similarly 
M" ~ M + 2m, M" ~ M + 3m, etc. The modern theory 
of heights shows that the height of the n-th point of this 
sequence, i.e. the size of the integers needed to express its 
coordinates (cf. Chap.II, Appendix IV), is of the order of 
magnitude of n’, in contrast with the points produced by 
the “traditional method”, whose heights increase exponen- 
tially since those methods rest upon the duplication formulas. 
This accounts for Euler’s findings. 


SXVII. 
[For brevity we will write ¢(s) for the series 2 in‘ or rather, 
as Euler would put it, for 
p¢te tats 
a er al 
L(s) for 38 (— 1)"(2n4 1)™ or 


and n! for 1.2.3...n]. 
At the time of Euler’s apprenticeship under Johann Ber- 
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noulli, the summation of the series ((n), when mis an integer 
22, was a classical problem which had exercised Leibniz 
and the Bernoulli brothers (cf. Corr.II.15|1737). The sum- 
mation of L(1) in 1674 had been perhaps the most striking 
early discovery of the young Leibniz (one which, as Huygens 
predicted, would be “for ever famous among geometers”: 
Huy. V11.394| 1674), but it depended upon the power-series 
expansion 
_x x n x 
arc tan x = 5 3 5 , 

which gives L(1) = 7/4; this could not be extended to ¢(n) 
nor to L(n) for any n > 1. Even the numerical evaluation 
of these series was no mean problem, in view of their slow 
convergence. 

In 1728 we find Daniel Bernoulli writing to Goldbach 
that the sum of the series ¢(2) is “very nearly 3” 
(Corr.11.263)1728); Goldbach, using an elementary method, 
answers that ¢€(2) — 1 lies between 16223/25200 and 
30197/46800 (and thus between 0,6437 and 0,6453: 
Corr.11.282|1729). At that time Euler was already in Pe- 
tersburg, and in daily contact with Daniel; he must have 
been aware of those letters. Soon he was presenting to the 
Academy a paper ending up with a much better evaluation 
for €(2), viz., 1,644934, obtained by an ingenious application 
of integral calculus (Eu.1-14.41 in E 20/1731). 

There can be no doubt that these and similar problems 
supplied Euler with a powerful motivation in his discovery 
of the so-called “Euler-MacLaurin summation formula” (it 
was published independently by MacLaurin in his Treatise 
of Fluxions, Edinburgh 1742). This is already stated in a 
paper presented to the Academy in 1732 (Eu.I-14.42— 
72 = E 25); three years later he gave a full exposition of 
the same topic (Eu.I-14.108-123 = E 47/1735; cf. Eu.I- 
14.435~—439 in E 130|1739, and Chap.V of his Institutiones 
Calculi Differentialis of 1755, Part Il, Ewu.1I-10.309-336). His 
procedure may be summarized as follows. 

The problem, as he sees it, is to find a formula for a sum 
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2f(i) taken over the range 1 <i <n, f(x) being what we 
should call an analytic function of x for all x > 0; from this 
is to be derived the value of X{f(2). The answer is sought in 
the form S(n), S being also an analytic function for x > 0. 
Clearly S will have the required property if it satisfies the 
difference equation 


S(x) — Se—1) = fx) 


and if S(0) = 0, or at any rate if it satisfies S(v) = 2if(i) for 
some integer v. As to the difference equation, Euler, using 
Taylor’s theorem (for which he refers to Taylor’s publication 
of 1715, although he must have known that it had been 
obtained previously by Leibniz and by Johann Bernoulli; 
cf. Eu.1-14.109), rewrites it in the form 


dS 1 d’s 1 ds 
IO Ge de Bae 


x 


and seeks to solve it by putting 


d d® 

S(x) = a fx + Bf + yb + s+ etc. 
with indeterminate coefficients a, B, y, etc.; these are to be 
determined by substituting back this tentative formula into 
the preceding equation and identifying both sides. ‘The ad- 
ditive constant implicit in the indefinite integral [fdx can 
then be determined by taking for x any integral value, e.g. 
x = 0. This gives him at once the values of a, B, y, etc. by 
means of a recurrence formula; he gets a = 1, B=1/2, 
then for instance 


etc., and hence (but “with a great deal of work”, “multo 
labore”: Eu.I-14.436) the numerical values for the first four- 
teen of those coefficients; those of even order, other than 
B, all turn out to be 0. 
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In this Euler says that he had proceeded according to “a 
known method” (“methodo cognita”; Eu.J-14.112), by which 
he probably meant no more than the “method” of indeter- 
minate coefficients. Actually he had been bold to the point 
of rashness; but, as the Latin proverb says, “audaces fortuna 
iuvat” (fortune favors the bold). Formally speaking, his pro- 
cedure can be clarified by using the operator D = d/dx; 
then one may write: 


S(x—1) = e~?S(x), flx) = S(x) — Sie-1) = (1—-e7”)S(x) 
and, formally of course, 
S(x) = (1-e7”) fx) = (aD7'+ B+ yD + 8D* + -)f(x) 


where a, 8, y, etc., appear now as the coefficients of the 
Taylor expansion of z/(1—e*) at z = 0 (Ew.1-14.436 in 
E 130)1739). From this fact, which had escaped Euler’s notice 
at first, one deduces easily that the coefficients of the even 
powers of D in the above formula are all 0, as his previous 
numerical computations had indicated (cf. zbid.). If now one 
puts 


“1 
i = 2 gm? 


one gets by = 1, b; = 3, ben; = 0 for m > 0; as to the bon 
for m > 0, they have the successive values 


l 1 1 1 5 691 


6’ 30°42’ 30°66’ _2730° “S 


and Euler’s result, formally speaking, amounts to the formula 
(EM) S(x) = > f@ + [ fd 
i=1 


+ Doh lp Ma) fer ve] 


where the f”, for i = 0, denote f and its successive derivatives. 
As Euler observed at once (Eu.I-14.43—44 in E 25/1732; 
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Eu.J-14.115-117 in E 47/1735), this result is valid at any 
rate when f(x) is a polynomial in x. Then the series given 
above for S(x) reduces to a finite sum and defines a poly- 
nomial which is a solution of the difference equation 
S(x) — S(x-—1) = f(x). For instance, taking f(x) = x’, r 2 1, 
and v = 0, one gets: 


I+ OQ + + nl = S,(n) 


where S, is the polynomial given by 


r+] r 1 
Sx) = * + = + > (3s, i Dom xve to em 


r+1 2 rHd 


(Eu.I-14.116-117), or more simply by: 


dS,(x) _ _ 4. f(r rom _ 
eo Bile) = S (7) , §,(0) = 0. 


m=0 


This result had been anticipated, partly by Fermat (cf. above, 
Chap.II, §11), and more fully by Jacob Bernoulli (cf. pp.96— 
97 of his Ars Conjectandi published posthumously in Basel 
in 1713); the latter’s priority was eventually perceived by 
Euler, who then adopted A. de Moivre’s designation “Ber- 
noulli numbers” for the numbers b»,, (cf. A. de Moivre, 
Miscellanea Analytica, Londini 1730, Complementum, pp.6 
and 19-21): “these are the numbers”, Euler wrote, “that 
are called Bernoullian from their inventor Jacob Bernoulli, 
since he used them in his Ars Conjectandi to sum the powers 
of natural integers” [“numeri ab inventore Bernoulliant vocatt, 
quippe quibus olim Jacobus Bernoulli in Arte Conjectand: est usus 
ad progressiones potestatum numerorum naturalium summandas”: 
Eu.1-15.92 in E 393|1768; cf. the similar comment in his 
Institutiones Calculi Differentialis of 1755, Eu.I-10.321]. 
What Euler had chiefly in mind at this juncture, however, 
was the numerical evaluation of such series as ¢(n) for n = 
2, 3, 4, etc., and also, to begin with, of the partial sums of 
the series €(1). In such cases a modern analyst would say at 
once that the summation formula (EM) breaks down because 
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(as Euler himself did not fail to discover: Eu.I-14.357 in 
E 125/1739; cf. Eu.1-14.119 in E 47|1735 and, in the Insti- 
tutzones Calculi Differentialis, the passage Eu.I-10.327) the 
numbers |bz,,| increase so rapidly that the series involved 
are always divergent. The correct way to use them for pur- 
poses of numerical computation is to cut them down to 
finite sums and evaluate the remainder. Instead of this, 
Euler’s prescription is to sum the terms “until they begin 
to diverge” (“quoad termini divergere incipiant”: Eu.1-14.357); 
perhaps one should rather say that he allows himself to be 
guided by his instinct (not a bad guide, truly, in such matters). 
It is thus that he is able to compute €(2) with 20 decimal 
figures: 


£(2) = 1,64493406684822643647... , 


¢(3) with 15 figures, €(4) with 16 figures, the so-called “Euler 
constant” with 16 figures (Ew.I-14.119—122 in E 47|1735), 
then 7 with 15 figures (Eu.1-14.359 in E 125]1739); eventually 
he used the same method to compute ¢(n) with 18 figures 
for 2 < n < 16 (Corr.1.207|1743; cf. Eu.I-14.440 in 
E 130)1739, and Chap.XI of the Introductio of 1748, Eu.1- 
8.201—205). There can be no doubt that he took much 
pleasure in such calculations, even apart from the theoretical 
conclusions which he was sometimes able to derive from 
them. 


SXVIII. 


“So much work has been done on the series {(n)”, Euler 
writes in 1735, “that it seems hardly likely that anything 
new about them may still turn up ... I, too, in spite of 
repeated efforts, could achieve nothing more than ap- 
proximate values for their sums ... Now, however, quite 
unexpectedly, I have found an elegant formula for ¢(2), 
depending upon the quadrature of the circle [i.e., upon 7]” 
[“Tantopere iam pertractatae et investigatae sunt series reciprocae 
potestatum numerorum naturalium, ut vix probabile videatur de 
us novi quicquam inveniri posse ... Ego etiam iam saepius . . . 
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has series diligenter sum persecutus neque tamen quicquam aliud 
sum assecutus, nisi ut earum summam . . . proxime veram definiverim 
... Deductus sum autem nuper omnino inopinato ad elegantem 
summae huius seriec 1 + 4 + § + de + etc. expressionem, quae 
a circuli quadratura pendet”: Eu.1-14.73—74 in E 41|1735]. 

When he wrote these lines, he had obviously just discovered 
his famous result {(2) = 7°/6. His paper on that subject, 
and on various closely related summations, was presented 
to the Academy on 5 December 1735; it was not printed 
until 1740 (Eu.I-14.73-86 = E 41), but by that time everyone 
who counted in the international mathematical community 
had been apprised of it: first of all, perhaps, Euler’s friend 
Daniel Bernoulli in Basel (cf. Corr.11.435|12 Sept. 1736 and 
Corr.I1.15—16|1737); also STIRLING in Edinburgh (Eu.IV 
A-1, no.2621]8 June 1736), EHLER and KUHN in Dantzig 
(PkU.345|3 April 1736), POLENI in Padova (PAU.216|13 
March 1736), NAUDE in Berlin (PkU.193—199|1740), and, 
directly or through these, many others including of course 
Johann Bernoulli (cf. Corr.11.15—16|1737). As to Goldbach, 
he was in Petersburg at the time of the discovery and surely 
must have heard of it at once. In 1742, noticing (not without 
a shade of disappointment) that CLAIRAUT was insuff- 
ciently informed of the matter, he promptly sends him full 
details, prefacing them with the words: “M’ Jacq. Bernoulli 
... parle de ces suites, mais ... tl avoué, que malgré toutes les 
peines, qu s’etort données, il n’avoit pa venir a bout, de sorte que 
M” [Jean] Bernoulli, de Mowvre et Stirling, grands Maitres dans 
cette matiere, ont été extrémement surpris, quand je leur annongors 
que j avois trouvé la somme de cette serie 

1 1 I 


1 
I+atotiet ont et 


et méme de celle-ci 


quand n est un nombre pair” [“Jacob Bernoulli does mention 
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those series, but confesses that, in spite of all his efforts, he 
could not get through, so that Joh. Bernoulli, de Moivre 
and Stirling, great authorities in such matters, were highly 
surprised when I told them that I had found the sum of 
¢(2), and even of ¢(n) for n even”: Fu.ITV A-5.120]. These 
are almost the same words which Fermat had once used in 
describing one of his own discoveries (“c’est une de mes in- 
ventions qui a quelquesfois estonné les plus grands maistres”: cf. 
Chap.II, $XV). From 1736 onwards, and for the next eight 
or ten years, Euler’s result, and the degree of validity of his 
proof for it, become a recurrent topic in his correspondence 
with his friends and colleagues all over Europe. 

Here again Euler had been rash, applying Newton’s theo- 
rem on the sums of the n-th powers of the roots of algebraic 
equations to transcendental equations of the form 
y = sin s, where y is given and s is the unknown. Using the 
power-series for the sine, and calling A, B, C, etc., the roots 
of the equation, he writes it as 


3 5 
RY s 


S 
+ 
y  1.2.3.y  1.2.3.4.5.y 


-(ea)la}leaben 


and concludes from this, firstly, that 


0=1 


+ etc. 


i tolokg 
y A BC oO” 


then, comparing terms in s”, that 


0) = i + + _l toda, ° _l Iii 
~ AB 2\A B Q\ A? B? , 


which gives %1/A® = 1/y’, and similarly for higher powers. 
Taking in particular y = 1, he states that the roots A, B, C, 
etc., are g, g, —3q, — 3q, 5q, 5g, etc., where g is one fourth 
of the circumference of a circle of radius | (i.e. 7/2 in the 
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notation he was soon to adopt). This gives: 
11 
jaa 2(ietebn) 
ts 


which was Leibniz’s result, and further: 


2\? 1 1 
1=2 x (2) (isk+he~) 


It was of course obvious to Euler that one has 
1 1 1 1 1 l 1 
V+ gett = + opt gst - ep tptet™ 


1 1 1 3 
= (d+f+-}(1-3) = qol2)» 


so that the above conclusions give £(2) = 7°/6. The same 
procedure, pushed further, gives to Euler (somewhat la- 
boriously, “multo labore”: Eu.I-14.85) the values of ¢(n) for 
n = 4,6, 8, 10, 12, and those of L(n) for n = 3, 5,7. Applied 
to other values of y than y = 1, it gives the value of still 
other series (Dirichlet series in modern terminology). All 
this is rapidly sketched in Euler’s article of 1735; one can 
almost feel in it the feverish excitement of the writer. 

Euler’s method had been open to grave objections, as 
he perceived himself (cf. Eu.I-14.79 in E 41|1735, and 
Eu.J-14.139-140 in E 61/1743) even before they were 
pointed out by his correspondents: Johann Bernoulli in 
1737 (Corr.11.16-17), as well as Daniel Bernoulli (cf. 
Corr.11.477|1741), Daniel’s cousin Nicolas Bernoulli (cf. 
Corr.11.683]1742), Cramer in Geneva (cf. Corr.11.477,683) 
and perhaps others. Apart from the question of the con- 
vergence, or possible divergence, of the series and products 
involved (a question raised, somewhat clumsily, by Nicolas 
Bernoulli: Corr.11.683—684 and 691]|1742), one could well 
ask whether the transcendental equation 
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had not, besides its “visible” real roots, other “imaginary” 
roots which would vitiate the whole calculation (Corr. 11.16; 
cf. Eu.IV A-5.121); nor was this objection limited to the 
possible existence of roots of the form a + 6V—1, even if 
this could have been meaningfully asked before Euler de- 
fined the sine for such values of the argument (cf. infra, 
§XIX); for the so-called fundamental theorem of algebra, 
which at least had the status of a plausible conjecture for 
algebraic equations (cf. above, § VII) could not even be rea- 
sonably formulated for equations “of infinite degree” such 
as the one Euler was trying to handle. More specifically, it 
was objected to him that if an ellipse was substituted for 
the circle in the geometric definition of the sine, his con- 
clusions would become obviously false, while the visible real 
roots of his equation would remain the same (Corr.11.477). 

At first Euler held his doubts in abeyance by observing 
that his method agreed with Leibniz’s result in the case of 
L(1) (so that, as he says, if the equation 1 = sin x had “im- 
aginary roots” besides the visible real ones, at least the sum 
of their reciprocals would be 0), and also by comparing his 
new values for (2), £(4), etc., with the numerical values he 
had computed previously (cf. above, §X VII). Thus, he says, 
“I did not hesitate to publish these sums as being perfectly 
true”(“non dubitavi istas summas tanquam verissimas producere”: 
Eu.1-14.140 in E 61/1743). 

Nevertheless, for the next ten years, Euler never relaxed 
his efforts to put his conclusions on a sound basis. At first 
this led him only to an analytical proof for (2) = 7/6 (or 
rather for the equivalent result 1 + 1/3 + 1/5? + ++ = 
m*/8) which did not seem capable of being extended even 
to ¢(4) (cf. his letter of 1737 to Johann Bernoulli, 
Bibl.Math.{111) 5 (1904), pp.258—259, and P. Stackel’s article, 
Eu.I-14.156—176). But now his attention had been drawn 
by his heuristic proof of 1735, as well as by his earlier work 
on what is now called the gamma function (cf. Corr.1.3— 
6|1729, etc.) to infinite products in their relation to infinite 
series, and this, too, led him to a momentous discovery. In 
a paper presented to the Academy in 1737 (Eu.I-14.216—- 
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244 = E72), after discussing some rather queer series sug- 
gested to him by Goldbach, he goes on to consider some 
infinite products “no less admirable”, he says, “since in them 
the factors proceed according to the prime numbers, whose 
progression is no less irregular {than the terms in Goldbach’s 
series]” (“.. . neque minus erunt admirabilia... cum... in his 
... termini progrediantur secundum numeros primos, quorum 
progressio non minus est abstrusa”: Eu.1-14.227). What he means 
by this is the famous “eulerian” product for ¢(s): 


us) = T]a-pyy7! 
p 


extended to all primes p, or in his notation 


i¢4ty 354g ee ete 
gr gn" gr  “(2" 13" = 15" — D7" — 1). ete. 

(Eu.I-14.230), and the similar product 

T l- i. 1 Ly 1 tc. = 3-5-7-11-13-etc. 

4 3°57. 9 SO ~ 4.4-8-12-12-etc. 
or in modern notation 

p 
LQ) = 


where the sign is determined by p + 1 = 0 (mod 4) (Eu.I- 
14.233; cf. Corr.1.577—578|1752). From the former result 
he deduces that the sum & 1/p is infinite “as the logarithm 
of the sum of the harmonic series > 1/n”; as he knew that 
the sum =¥ 1/n is of the order of magnitude of log N (cf. 
e.g. Eu.1-14.87-100 = E 43]1734), this meant that the sum 
Xp<v 1/p has the order of magnitude of log log N. From 
the result about L(1) he was later to conclude that the sum 
X»+2 +1/p (where the sign is determined as before, and 
the odd primes are arranged in increasing order) has a 
finite value, approximately 0,334980 (Corr.1.587|1752; cf. 
Eu.1-4.146—153 in E 596|1775). From these observations it 
follows obviously that the sum 21/p extended to the primes 
of the form 4n + I, and the similar sum extended to the 
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primes of the form 4n — 1, are both infinite and of the 
same order of magnitude. 

Some years after thus introducing “eulerian products”, 
Euler devoted to the same topic a whole chapter of his 
Introductio in Analysin Infinitorum (Chap.XV = Eu.I-8.284— 
312), where he not only writes the products for ¢(s) and 
L(s) in almost modern form, writing for instance for ¢(n) 
the following expression 

l 


but extends this to what we would now call L-series with 
Dirichlet characters modulo 3 (Eu.I-8.308—309) and modulo 
8 (Eu.I-8.310-311). He even shows how such series (and 
“innumerable others”: Eu.I-8.306) can be summed for 
n = |, using the results of the preceding chapter X (cf. 
infra, §X1X). One may well regard these investigations as 
marking the birth of analytic number-theory. 


§XIX. 


In 1735 Euler had been unable to prove his results on 
¢(2n) to his own and his colleagues’ full satisfaction. In 1745 
he had already completed in manuscript his Introductio in 
Analysin Infinitorum, presenting in its Chapters VIII to XI 
a masterly exposition of a theory of trigonometric functions 
and their series and product expansions (Eu.I-8.133—212), 
with the results in question as one of its main applications. 
Some of it had been published in Berlin some time before 
(Eu.I-14.138-155 = E 61{1742); most of it had apparently 
been discovered in the first year or two of his stay in Berlin 
(cf. Ewu.IV A-5.115,120—-124]1742 and Corr.1.131—132|1742). 

Since his younger days he had known that e* is the limit 
of (1+x/n)” for n > & (a fact which he chose to express by 


writing 
x n 
C= (: + :) 
n 


“for n infinite” (“exestente n numero infinito”: Eu.I-14.143 in 
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E 61|1742; cf. his Introductio, Eu.1-8.132, 147, etc.). He was 
also familiar with the power-series for e¢*, sin x, cos x; in 
particular, he knew that the power-series for e* can be ob- 
tained by expanding (1 +.x/n)" by the binomial formula when 
n is taken to be a (finite) integer, and then taking the limit, 
term by term, for n — © (or, as he preferred to put it, 
substituting for n an infinite number: Fu.I-8.123-124). 

In 1740, in an exchange of letters with Joh. Bernoulli on 
the solutions of linear differential equations with constant 
coefficients (among other topics), he stated for the first time 
the celebrated formula 


Qcosx =e? ' + eve! 


> 


deriving it from the identity between the power-series ex- 
pansions of both sides (Bibl.Math.(I1D) 6 (1905), pp.76—77); 
implicitly he was using the power-series for e* in order to 
extend its definition to complex values of the argument. 
The similar formula for sin x must of course have been 
discovered at the same time (cf. Ewu.J-14.142,144 in 
E 61|1742). Combining this with the above formula for e”, 
Euler could then conclude that sin x is the limit for n — 
(or, as he said, “the value for n infinitely large”) of the 


polynomial _ 
_ 1 xV—1\" _ xv 1\" 
P(x) = aT — (: —— | 


Now P,, (for finite n, as he would say) can easily be split into 
its real factors of degree | and 2 (cf. above, §VII). For an 
odd n = 2p + 1, this is effected by using the identity 


p + . 
x" _ y” _ (X—Y) I] (X — et 27/7y) (X —e7?m/ny) 
v=] 
p QTV 
= (X-Y) [] [X?-2XxY cos —+Y?*}. 
v=l n 
This gives, again for n = 2p + I: 


Qav 
1 + cos— , 
x 


p 
P,(x) = x I] Qilx), Qrlx) = 1 - 
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Clearly, for any given v, and “for n infinite”, Q,,,(x) becomes 
1 — x*/v?r*; from this Euler draws the conclusion 
% 2 
. x 
sinx = x {1--z4], 
11 (1-3) 


and a similar calculation gives him the corresponding infinite 
product for the cosine: 


= 4x? 
cos x = a (: "Qr4 le ==! 


(Eu.1-14.141~146 in E 61/1742; cf. the Introductio, Chap. 
IX = Fu.J-8,153-176), showing, in his own words (Eu.I- 
14.146), that the equations of infinite degree which he had 
previously applied to the calculation of the series (2), £(4), 
etc., “have indeed no other roots” than the visible real ones. 
As noted above, all this is expressed by Euler in the language 
of “infinitely large” and “infinitely small” numbers (a lan- 
guage that has been revived in our days by the adepts of 
so-called “non-standard analysis”). The evaluations which, 
from our present point of view, would legitimize his rea- 
sonings are of course lacking; but in the present case any 
tiro could easily fill them in. 

From these infinite products, as Nicolas Bernoulli rightly 
observed (Corr. 11.689] 1742; cf. ibid.,p.694), one deduces at 
once, by logarithmic differentiation, further formulas, such 


as 
WT COS 1X 1 = I 1 
sin 7x x vp+x vy-x 


which Euler writes as follows: 


COS 7X d } 1 1 1 


7 + etc. 
sin 7x x }~x lt+tx 2-x 24+x 3-x«x 


(without the parentheses which in our eyes are to guarantee 
the absolute convergence of the series). Combining this with 
the formula which can be similarly derived from the product 
for cos x, one obtains, in Euler’s notation 


7 1 1 1 I 1 1 
- + + —— + — - 
sin 7x x }-x l+x 2-x 2+x 3-x 3+x 


— ete. 
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(cf. e.g. Fu.1-14.148). For this Euler has two other proofs, 
one based on integral calculus (Eu. I-14.146—148 in E 61/1742; 
cf. his letter to Clairaut, Eu.IV A-5.115]1742), the other 
based directly on the extension of the above infinite products 
to functions of the form cos x + cos a, sinx + sina (Eu.I- 
8.170—176, 184-187 in Introductio, Chap. X—X). From this 
it is easy to derive by repeated differentiation not only the 
values of ((n) for even n > 0 and of L(n) for odd n > OQ, but 
also that of infinitely many other “Dirichlet series” (as they 
would now be called) (cf. e.g. Corr.1.132|1742 or Eu.1-14.151- 
152). In particular one finds that €(n) is of the form 7(27)" 
for every even n > Q, and L(n) of the form s(27)” for every 
odd n > 0, the numbers 7, s being rational. 

At first sight there is no connection between those results 
and the Euler-MacLaurin formula. Euler, however, could 
not but notice fairly soon the relation between the values 
of ¢(2n), which he had calculated already in 1735 up to 


6917"? 
g(12) = 6825 x 93555 


(Eu.1-14.85 in E 41|1735), and the numbers 65, (not yet 
called by him Bernoullian) which had turned up in his work 
on the summation formula (cf. Eu.I-14.114 in E 47|1735); 
in particular the occurrence of the unusual prime 691 (a 
kind of “tracer” for Bernoulli numbers) both in ((12) and 
in by, could not have failed to strike him. Once this had 
been observed, Euler had no difficulty in establishing the 
general relation 


_ L _ n— 1020 Qn 
U(2n) = 5(- D'S 2m 


(Eu.I-14.434—439 in E 130]1739). 


§XX. 


The mystery of the values of Cn) for odd n > I still 
remained intact; indeed it has remained so down to the 
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present day. Knowing of course that ¢(s) is related to the 
alternating series 


1 
os) 1-24 3-1 ete 


by the formula 
ols) = (1-2)e0, 


Euler apparently thought of clearing up the mystery by first 
investigating the values to be attributed (according to his 
views on divergent series: cf. Eu.I-14.585-617 = E 247|1746, 
and the passage Eu.I-10.81—-82 in his Institutiones Calcul 
Differentialis of 1755) to the divergent alternating series 


g(—m) = 1 — 2" + 3" — 4" + etc. 


for odd values of m > 0 (cf. Eu.J-14.443 in E 130]1739). To 
him this could be done by taking g(—m) = R,,(1), where 
R,, is the rational function defined (for |x| < 1, as one would 
insist on saying now) by the series 


R,(x) = 1 — 2" +3"x? — 4"x° + etc. 


“Differential calculus”, as he says, “gives us an easy way of 
summing such series” (“Le calcul différentiel nous fournit un 
moyen fort aisé de trouver la somme de ces séries”: Eu.1-15.71 in 
FE 352|1749]. In fact, one has Ro(x) = 1/(1+2), and, for 
every integer m = 0: 


Rnsi) = <eR.()], 


so that, for m > 0, R,, is of the form 


PlX) 


R,,(x) = (L+x)"*! 


where P,, is a polynomial of degree m — 1 with integral 
coefficients. Moreover, by writing down P,, for a few values 
of m, as Euler must have done, one notices at once that R,, 
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satisfies the condition 
ra(2) = (— D's Ryle), 


a fact which can then be verified easily by induction on m. 
Thus or otherwise Euler found that g(—m) = R,,(1) = 0 
for every even m > 0 (Eu.I-14.442 in E 130|1739). 

A far more surprising result comes out when one computes 
even the first few values of g(—m) for odd m = 1; when 
Euler did this, it must have become immediately clear to 
him that those values are related to those of (m+ 1), which 
in turn he had found to be related to the Bernoulli numbers 
bn+1- Actually the relation between them can be written as 


b 
_ = = m+\_ mt+t 
elm) = Ra(1) = 2" = 1) Bt, 
this being valid for all odd m > 0, and also for all even 
m > 0 since then both sides are 0. In 1739, Euler, having 
made the computation for m = 1, 3, 5, 7, chooses to write 
the result as 


1 — 2" + 3” — 4” + etc. 


+ 2-1-2-3---m 1 1 
= oe (+ gnri + omit etc.) 
with alternating signs +, —, +, — form = 1, 3,5, 7; here 


the series in the right-hand side has clearly the value 
(1-2°>""")&(m + 1). 

There Euler let the matter rest for the next ten years, 
until at last he looked for a general proof and of course 
found it, using, as he said, “a most peculiar method” [“// 
faut employer une méthode toute particuliére pour démontrer cette 
harmonie”: Eu.1-15.75 in E 352|1749]. The “peculiar” method 
consisted in a wholly illegitimate application of the Euler- 
MacLaurin formula to the series g(—m), so hazardous that 
there would be little point in describing it here; even Euler 
might have hesitated to put it forward, had he not had two 
other methods at his disposal for checking the results in 
each specific case, one of these being the one indicated above; 


Euler 275 


for the other, cf. Eu.1-14.594—595 in E 2471/1746, and Eu.I- 
10.222-224 in Part I], Chap.I, of the Institutiones Calculi 
Differentialis. Anyway he did obtain the correct value as shown 
above, thus verifying the validity of the above formulas for 
all odd m > 0. 

This time, however, Euler did not stop there. Substituting 
n— 1 for m, and rewriting the result in terms of the function 
y, one gets 


gil=n) 5 1:23... (a= DQ"=)) 
p(n) " (2""'—1)n" 


with C, = (—1)"”, this being valid for all even n > 0. As 
Euler observes, the left-hand side has the value 0 for all 
odd n > 1, so that the formula is also valid for those values 
of n provided one takes then C,, = 0. 

Ever since his early days in Petersburg he had been in- 
terested in the interpolation of functions and formulas given 
at first only for integral values of the argument; that is how 
he had created the theory of the gamma function (cf. 
Corr.1.3—4|1729). Not surprisingly, he sought to do the same 
for the above formula by substituting cos wn/2 for C,, and 
the gamma function, i.e. I'(n) in the notation later introduced 
by Legendre, for (n—1)!. The result is equivalent to what 
we know as the functional equation of the zeta function. In 
view of what Euler knew about the gamma function, it holds 
for all integral values of n, positive or negative (including 
n = Oandn = 1), also for n = 3, and it could be checked 
numerically, within the limits of accuracy of his calculations, 
for some other values of n. Well could he entitle his paper 
“Remarques sur un beau rapport entre les séries de putssances tant 
directes que réciproques” (Eu.1-15.70-90 = E 352|1749). For 
good measure he threw in the functional equation for the 
function 


I | I 
L =l]-~—+---xz4 etc, 
(s) 3° 5S 7 € c 
arrived at in the same way. “This last conjecture”, he wrote, 
“is expressed by a simpler formula than the preceding one; 
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as it is equally certain, one may hope that efforts to find a 
perfect proof for it will be more successful; this will not fail 
to throw much light on a number of similar investigations” 
(“Cette derniére conjecture renferme une expression plus simple 
que la précédente; donc, pursqu’elle est également certaine, il y a 
a espérer qu'on travaillera avec plus de succés a en chercher une 
démonstration parfaite, qui ne manquera pas de répandre beaucoup 
de lumiére sur quantité d’autres recherches de cette nature”: Eu.1- 
15.90]. 

Euler’s paper, written in 1749, appeared in 1768 in the 
Memoirs of the Berlin Academy; it was never reprinted. 
Some of its conclusions, in the far less suggestive form in 
which they had been stated in Euler’s memoir of 1739 
(E 130, published only in 1750 in the Petersburg Commentarit) 
were repeated in 1773 under a less than promising title 
(‘Exercitationes Analyticae”: Eu.\-15.131-167 = E 4321772). 
For a hundred years after their discovery, Euler’s functional 
equations were utterly forgotten. Even in 1849, when both 
Schlémilch and Malmstén did mention the equation for L(s) 
(cf. Eu.I-16°, pp.LXXXIV-~LXXXV, note 2, and A. 
Weil, Coll.Papers I11.293), they clearly regarded it as a mere 
curiosity. In 1859 Riemann resurrected the subject; its fur- 
ther history does not concern us here. 


SXXI. 


In September 1740, Euler, who was already contemplating 
his move to Berlin (cf. Corr.11.461|1740), received from his 
future colleague Philippe Naudé a letter which raised a 
number of mathematical questions (Eu.1V A-1, no. 1903/29 
Aug.1740). 

Naudé, a mathematician of protestant French origin, was 
at that time active in Berlin, where he died in 1745, aged 
only 61 years. As natural at the time, his letter to Euler 
asked first of all for details about Euler’s summation of the 
series ((2n). Another question concerned an equation of 
Pell’s type. Finally Naudé proposed a combinatorial problem: 
in how many ways can a given number m be written as a 
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sum of (positive) integers, all distinct from one another? 
in how many ways can it be written as a sum of y& (positive) 
integers, distinct or not? what is for instance the answer for 
m = 50, uw = 7? 

Euler answered Naudé on 23 September “new style” 
(PkU.193—205, dated 12 Sept.1740 “old style”). Letters be- 
tween Berlin and Petersburg were then normally no less 
than 10 to 12 days in transit; thus Euler’s answer was prompt 
enough, even though he apologizes for its tardiness (“due 
to bad eyesight”, he writes, “from which I have been suffering 
for some weeks”: PkU.193; cf. Corr.I.102|1740, and above, 
§III). Nevertheless he is not only able to give full details on 
¢(2n), but he can already outline the solution to Naudé’s 
last problems, thereby creating in effect a new branch of 
analysis to which he gave its name, “Partitio Numerorum”. 
Soon he was giving a full exposition of his results in the last 
memoir which he presented personally to the Petersburg 
Academy before leaving for Berlin (Eu.I-2.163-193 = 
E 158|6 April 1741). He was to come back upon the same 
topic in greater detail in Chap.XVI of the Introductio in 
Analysin Infinitorum (Eu.1-8.3 13-338) and in a memoir also 
composed in Berlin (Eu.1-2.254-294 = E 1911750), later 
adding a few more results in a paper written in Petersburg 
(Eu.I-3.131-147 = E 394|1768). 

As he had perceived at once (possibly even before hearing 
from Naudé: cf. PkU.203), the key to such questions lies in 
the consideration of suitable formal power-series in one or 
more indeterminates and in the relation between products 
(finite or infinite) of binomials and their expansions into 
sums or into series; as to the latter aspect, he was also en- 
countering it in his investigations on trigonometric functions 
on the one hand, and on “eulerian products” on the other 
(cf. above, §$XVIII-XIX). Now, in order to solve Naudé’s 
problems, he introduces first of all the infinite product 
IL, (1 +x'z); in modernized notation, his calculation is as fol- 
lows. Put: 


P(x, z) = I] (l+x'z) = S A(x)" = > Nin "5 
1=1 p=0 


m,2=0 
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as he observes, N,,,, 1s the number of ways in which m can 
be expressed as a sum of w distinct integers, i.e. the number 
occurring in Naudé’s first question. One has now 


P(x, z) = (1 +xz)P(x, xz), 
hence, identifying both sides: 
A,(x) = A,(x)x* + Ay i(x)x*. 
By induction on yp, this gives: 
gilt D2 


(L=x)(1—x?) + (L-x") 7 


A, (x) = 
Now put: 


i Qt _ m, 
(Ltxitxe%4-) = S Maur” 


l ad 
= m=) 


G—x—-x)-U—-x) 


z 


M,,,. is the number of ways in which m can be expressed as 
a sum of integers (in any number) belonging to the set 
{1, 2, ..., w}. This gives: 

Aye) = x@#tD2 S" My x",  N 


m=0 


mje Mn -piu+ 1)/2,p* 


On the other hand, Euler also introduces the product: 


fT 
4 
| 
x 
Ss 
{ 
i 
ls 
+ 
cao 
nN 
+ 
ad 
Ny 
+ 


Q(x, 2) 


i=} 2=1 


\\ 


[] Buz = PL Niux"2": 


w=0 m,.=0 


here N,,,,, is the number in Naudé’s second question, i.e. 
the number of ways in which m can be written as a sum of 
# integers, distinct or not. Here one has 

Q(x, xz) = (1—xz)Q(x, z) 
and then, proceeding just as before: 


x 


(1—x)(1—x*) + (1=x")’ 


Nig = M 


mM, je m— pee 


B,(x) = 


Euler 279 


From these facts Euler is able to derive a number of re- 
currence formulas between the coefficients M,,,, and to 
construct extensive tables for these numbers. Another 
noteworthy identity concerning partitions is the following: 
Ila+ i Il, (1-—x” ] 
*) = Fe Wow = Fe Leh 
i=] ini (t~x') Hj, (L-x*™’) 
which, by the same argument as before, connects the number 
of ways an integer can be written as a sum of distinct integers 
with the number of ways of expressing it as a sum of odd 
integers, distinct or not. 

It was of course at once obvious to Euler that his numbers 
M,,,, have a property of “stability” when m increases; more 
precisely, they are independent of yu for w = m, and one 
has 


(1—x')7) = 3) Manx”. 


1 m=) 


, 


t 


Already in 1741 this suggested to him to calculate the product 
Ij, (1—x’). “Let me end this paper”, he writes, “with a 
noteworthy observation, which however I have not yet been 
able to prove with geometric rigor” [“Finem huic dissertationi 
faciat observatio notatu digna, quam quidem rigore geometrico 
demonstrare mihi nondum licuit”: Eu.1-2.191 in E 158{1741]. 
The result, conjectured after Euler had carried out the 
product in question for no less than 51 factors: 


(1—x)(1—x?) + (1-2?! 
=Jl—-x-x4+x 4%! 

— x}% — xB 4 xP? 4 x76 — x89 — xf? + x9) + ete. 
was indeed astonishing; all coefficients in the series are 0 
or +1, and in the exponents Euler recognized at once the 
“pentagonal numbers” $n(3n+ 1) for n = 0, -1, +1, ..., 
—5, +5, —6, suggesting the celebrated formula 


a eed 


d —x') _ »y (- Lyra 1/2 


1 n= —% 


o. 


it 


t 


(Eu.1-2.191-192 in E 158|1741; cf. Corr.11.467—468/28 
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Jan.1741, Eu.I-2.280-281 in E 191|1750 and Eu.1-8.332 in 
Chap.XVI of the Introductio). With its generalizations it was 
to play a central role in the next century in Jacobi’s Fun- 
damenta Nova of 1829 and in all his later work on theta 
functions (cf. Jac.1.234—236, and Eu.I-2.191—192, note 1). 
Of course Euler’s gifts of divination did not extend as far 
as that; to him the formula was no more than a formal 
identity amounting to an arithmetical result. 

Some years later he transformed that formula into a 
statement about the arithmetical function that he denoted 
by fn (cf. above, §VI); it is the sum of the divisors of n, 
including | and n, and is now commonly denoted o,(n); his 
interest in it had lately been aroused by an attempt to deal 
with “amicable numbers” (Eu.I-2.59-61 = E 100|1747, Eu.1- 
2.86—162 = E 152)1747). Taking the logarithmic derivative 
of the rave Il (1 —x'), one gets formally 


flog [| (1-2) = ->= 


a) = - > ([N)x*~; 


aa 


1-~x 


I 

| 
Man 
—— 
Ms 


identifying this with the logarithmic derivative of the series 
in Euler’s identity, one obtains 


(— 1)” Ja - gn(3n+1)) = 0 
provided one puts fm = 0 for all m < 0, and provided one 
replaces f0 by N in that relation if such a term occurs in it, 
i.e. if N is one of the “pentagonal numbers” 3n(3n + 1). Euler 
lost no time in sending this to Goldbach, calling it “eine sehr 
wunderbare Ordnung” (Corr.1.407—408|1 April 1747); very 
soon he was writing it up for publication under the title 
“Découverte d’une loi toute extraordinaire des nombres par rapport 
@ la somme de leurs diviseurs” (Eu.1-2.241-253 = E 175|22 
June 1747). Of its validity he had no doubt: “elle appartient 
a ce genre dont nous pouvons nous assurer de la vérité, sans en 
donner une démonstration parfaite” (“it belongs to the kind of 
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results whose truth is certain even though we cannot give 
a perfect proof for it”: Eu.1-2.242). 

Of course this did not stop him from looking for a “perfect 
proof”, and (of course, one is tempted to say) he found one 
(Corr.1.522—524|1750; Eu.1-2.390-398 = E 2441751). This 
proof, another dazzling display of algebraic virtuosity but 
quite elementary, rests upon the construction of a sequence 
of formal power series Py = H(1 — x"), Pi, Ps, etc., such 
that the following relations hold for all n > 0: 


P,-) _ 1 _ xen} _ a ae 


P, a” "(1 —x")\(1 —x"*1). ve (1 =x" *”), 


The case n = 1 is obtained by a straightforward application, 
to the product II(1 — x”), of the obvious identity 


fon oC 


[] d-a,) = - 2a n(L—a)(1 — a9) + (1a, 1). 


n=l = 


Then, assuming the P; to have been constructed up to P 
we have: 


P,, =l]¢4 x"(L—-x"*) +4 Sx"(1-x"*1) rel _ x") 
v=2 


— x” _ Serena _ x”*!) vee (1 _ xt”), 
p=l 


Replacing v by » + 2 in the former sum, by v + I in the 
latter, and then combining the two, one finds indeed that 
this is nothing else than 1 — x°"*! — x°"**P.,,, so that one 
can proceed by induction on n. This gives: 
Py = 1 — x — x°(1—x*) + x? P°(1—x°) - 
+ (- Lyix? FoF Gnd P 


for all n > 0, which completes the proof. 
With another brilliant idea, arisen in the same context, 
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Euler was less successful. Having come to regard Fermat’s 
famous assertion about “three triangular numbers, four 
squares, five pentagonal numbers, etc.” (cf. above, §V) as 
belonging by rights to the subject of “partitio numerorum”, 
he naturally thought of applying to it the method he had 
introduced for such purposes, viz., the method of formal 
power-series (Corr.1.531—-532!/1750; cf. Eu.1-3.132, 144-145 
in E 394|1768). For instance, as he observes, the statement 
about four squares amounts to saying that if one writes s 
for the formal power-series 


s=lLtxi txt +x + etc. 


all powers of x occur in the series s* with non-zero coefficients; 
well he knew, of course, that the coefficient of x” in s* would 
indicate in how many ways n can be written as a sum of 
four squares. “This”, he writes to Goldbach, “seems to me 
the most natural way of attaining to the proof of Fermat’s 
theorems” [“Dieser Weg déucht mir noch der natiirlichste zu seyn, 
um zum Beweis der theorematum Fermatianorum zu gelangen”: 
Corr.1.532|1750]. Except for the substitution of the “theta 
series” 


9-1= > x” 


n=-x 


for s, this is precisely how Jacobi proved the theorem in 
question as a worthy conclusion to his Fundamenta, thus 
fulfilling Euler’s prediction. Nor was Euler content with the 
mere consideration of s as a formal series; he would not 
have been himself if he had not made it an occasion for 
numerical computations; what attracted him was its behavior 
as x approaches —1, or, what amounts to the same, the 
behavior of the series 


t= 1—xt + x? — x!© 4+ etc. 


nearx = | (Corr.1.529—531{1750). Here, substituting at first 
1 — y for x, he obtains formally 


> (= 1" -y)” = 4 + Oy + Oy? + Oy? + ete., 


n=O 


Euler 283 


this being an easy consequence of his earlier results according 
to which 1” — 2" + 3” — etc. is 0 for even m > 0 and has 
the value 3 for m = 0 (cf. above, §XX). It suggests that, for 
x approaching I, 2¢ — | tends to 0 faster than any power 
of | — x, an expectation which Euler seeks to confirm by 
computing ¢ with 10 decimal figures, successively for x = 
0, 2, 3, 4, 75 (no mean feat, in view of the slow convergence 
of the series ¢ for x close to 1). “But I have vainly looked 
for a sure way of summing this series numerically for x = 
1 — w when w is very small”, he writes to Goldbach; “a 
method for doing that would be of high value” (“es wére 
eine Methode hoch zu schédtzen, vermittelst welcher man im Stande 
ware den Werth von t proxime zu bestimmen, wenn w ein sehr 
kleiner Bruch ist”: Corr.1.531]. Here again one must admire 
Euler’s sureness of judgment; the answer to his question is 
contained in the functional equation of the theta series; 
this, too, was given to the mathematical world by Jacobi in 
his Fundamenta. 


SX XII. 


Euler’s arithmetical work, although it extends over some 
fifty years, makes up only a small part of his immense pro- 
duction. Nevertheless some of the main characteristics of 
his scientific temperament emerge clearly from the above 
survey. 

Perhaps its most salient feature is the extraordinary 
promptness with which he always reacted even to casual 
suggestions or stimuli: a question from Goldbach about the 
primality of Fermat’s numbers 2°" + 1 (cf. above, §§IV, Va 
and VI); Fermat’s statement on sums of four squares, which 
at first he did not even read through (§§IV, V6, XI); an 
elementary combinatorial problem proposed by Naudé 
(§§Vh, XXI); a few isolated theorems in a book by a little- 
known Italian nobleman (§§Ve, XV); a theorem by Lagrange 
on congruences modulo a prime (§VJ); Lagrange’s somewhat 
less than felicitous proof for Fermat’s theorem on sums of 
four squares (§X1); etc. Nor was he neglecting chance ob- 
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servations such as his calculation of II(1—x") (cf. §XXI); 
“such observations are all the more valuable”, he had written 
after receiving Fagnano’s volume (loc.cit. XV). Every oc- 
casion was promptly grasped; each one supplied grist to 
his mill, often giving rise to a long series of impressive 
investigations. 

Hardly less striking is the fact that Euler never abandoned 
a problem after it had once aroused his insatiable curiosity. 
Other mathematicians, Hilbert for instance, have had their 
lives neatly divided into periods, each one devoted to a 
separate topic. Not so Euler. All his life, even after the loss 
of his eyesight, he seems to have carried in his head the 
whole of the mathematics of his day, both pure and applied. 
Once he had taken up a question, not only did he come 
back to it again and again, little caring if at times he was 
merely repeating himself, but also he loved to cast his net 
wider and wider with never failing enthusiasm, always ex- 
pecting to uncover more and more mysteries, more and 
more “herrliche proprietates” lurching just around the next 
corner. Nor did it greatly matter to him whether he or 
another made the discovery. “Penitus obstuput”, he writes (“I 
was quite flabbergasted”: Fu.I-21.1 in E 506|1777; cf. his 
last letter to Lagrange, Eu.I1V A-5.505|1775) on learning of 
Lagrange’s additions to his own work on elliptic integrals; 
after which he proceeds to improve upon Lagrange’s 
achievement. Even when a problem seemed to have been 
solved to his own satisfaction (as happened with his first 
proof of Fermat’s theorem a’ = a mod #, or in 1749 with 
sums of two squares) he never rested in his search for better 
proofs, “more natural” (Eu.1-2.510 in E 2621755; cf. §VI), 
“easy” (Eu.1-3.504 in E 552|1772; cf. §VI), “direct” (Eu.1- 
2.365 in E 242|1751; cf. §VI); and repeatedly he found 
them. 

Some problems eluded him all his life. Most conspicuous 
among these perhaps, so far as number theory is concerned, 
are his conjectures related to the law of quadratic reciprocity, 
conjectures first formulated in 1742 and soon thereafter, 
and whose importance he was still emphasizing at the end 
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of his life (cf. 88V2z, VIII, IX); by that time he could scarcely 
hope to obtain a proof himself but was expressing his con- 
fident expectation that their proof would be “forthcoming 
soon” (“quam ... mox expectare licebit”: Eu.J-4.191 in 
E 598|1775). How pleased he would have been to see even 
the partial proof given by Legendre in 1785, not to mention 
the full proof found by Gauss in 1796 and published in 
1801! It was not given to him to reach the promised land; 
but his eyes, now turned inward, had been allowed a glimpse 
into its splendor, or, to use his own word, its Herrlichkeit. 


Appendix I 


The Quadratic Reciprocity Law 


As explained above in §VIII (cf. Chap.II, §XII), a prime 
p, assumed to be odd and not a divisor of N, is in Euler’s 
sense a “prime divisor of the form X* + NY?” if it divides 
some integer a? + Nb? with a prime to Nb; this is equivalent 
to saying, either that —N is a quadratic residue modulo p, 
or alternatively that p splits into two prime ideal factors in 
the field Q(V —N). Our purpose here is to describe the facts 
about those primes in the form best suited for comparison 
with Euler’s empirical results. 

Without restricting the generality we shall assume that 
N is squarefree and neither —1 nor 0; we shall put d = 
—-Nord = —4N according as N = — | (mod 4) or not, and 
D = |d|. Then there is a function w of integers with the 
following properties: 

(1) a(n) = +1 af nes prime to D; otherwise w(n) = 0. 

(II) w ts periodic of period D, i.e. it satisfies 


a(n) = w(n+D) 


for all n; this can also be expressed by saying that w is a 
function of integers modulo D. 

(IIL) w(mn) = w(m)w(n) for all m and n. This is expressed 
by saying that w is a “Dirichlet character” modulo D. In 
particular, w(n*) = 1 for all n prime to D. 

(IV) w(— 1) 2 +1 or —1 according as N ts <0 or >0. 

(V) There is no divisor D' of D other than D such that w(m) = 
w(n) whenever m, n are prime to D and m = n (mod D’). This 
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is expressed by saying that the character w is primitive modulo 
D; in view of (I) and (III), it amounts to saying that there 
is no divisor D' of D, other than D, such that w(n) = 1 
whenever n is prime to D and =1 (mod D’). 

Finally, the relevance of w to the question of the prime 
divisors of X? + NY? is given by the following: 

(VI) An odd prime p is a prime divisor of X* + NY? if and 
only if wp) = +1. 

In view of (1), (11) and (I1]), w determines a character of 
order 2 (in the sense now usual in group-theory) on the 
multiplicative group Gp = (Z/DZ)”* of integers prime to D 
modulo D; thus its kernel, i.e. the set of congruence-classes 
modulo D on which it takes the value 1 is a subgroup of Gp 
of index 2; according to (II1) this kernel contains the group 
(Gp)* of quadratic residues modulo D. 

It will now be shown that conditions (I) to (V) determine 
such a function w and determine it uniquely. Take first the 
case when D is a prime; this will be so whenever N= +q,’ 
where g is an odd prime and the sign is determined by 
N = —1 (mod 4). In that case, as Euler had found out not 
later than 1751 (Ew.I-2.339-355 in E 242|1751; cf. nos. 284— 
307 of the Tractatus, Eu.1-5.227—230, and above, §VI), 
quadratic residues make up a subgroup of index 2 of 
Gp = G,. Therefore w(n) is + 1 or —1 according as n is a 
quadratic residue or non-residue modulo q; in the notation 
in use since Legendre (cf. Chap.IV, §V1), this is written 


* , or more compactly (n/q). That this satisfies (IV) is 


nothing else than the determination obtained by Euler in 
1749 of the quadratic residue character (~— 1/g) of — 1 mod- 
ulo g (cf. above, §VI, and Chap.II, §VIJ). 

In the general case, write D = qo91q2 ... g,, where go is 1, 
4, or 8, and q:, qe, -.., q, are distinct odd primes. As Euler 
knew, at least from the time when he wrote the Tractatus, 
an integer is a quadratic residue modulo D if and only if it 
is such modulo each one of the q;. In modern terms, Gp/ (Gp) 
is a group of type (2, 2, ..., 2) with 2’, 2'*! or 27+? elements 
according as go is 1, 4 or 8; each one of its characters can 
be written as the product of some or all of the characters 
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(n/q;) for? = 1, 2, ..., 7, and, if go = 4 or 8, of a character 
modulo qo; a primitive one such as @ is then necessarily 
given, for x prime to D, by a product 


o(n) = wn) TT (2) 
i=) \Gi 

where W = 1 ifgo = 1, and otherwise wis a primitive character 
modulo qo. If gg = 4 there is only one such character Wo, 
given by (1) = 1, Wo(—1) = —1. If go = 8 we have Gg = 
{+ 1, +3 mod 8}; a primitive character of Gg must satisfy 
W(—3) = — 1 since otherwise y(n) would depend only upon 
the value of n modulo 4; thus there are two such characters 
1, Wo, respectively determined by w,(—1) = 1, Wo(—1) = 
—1.Ifq = 8, the character W occurring in the above formula 
for w is therefore the one determined by condition (IV), 
i.e. the one given by 


W(-1) = sgn(-N) [I] (=) 
i=1 a 
where sgn(—N) is | or — 1 according as —N is >0 or <0. 
The fact that condition (IV) is satisfied also for gg = 1 or 
4, when w» is determined as above, is again an easy conse- 
quence of the determination of (—1/q) for any odd 
prime q. 
As to condition (VJ), which can now be written as 


(s*) 0 


(this being valid for all odd primes #, not dividing N), take 
first the case N = 1. Then D = 4; w is the character Wo 
modulo 4; and (VI) says that (— 1/p) = o(p), which is indeed 
the case. Take N = +2; then D = 8; wisw, if N = —2, te 
if N = +2; then (VI) says that (2/p) = Wi(p), (—2/p) = 
Wo(p), as was found to be the case in §1X. 

Take now N = +g, the sign being determined by N = 
— 1 (mod 4), so that D = qg. Then (VI) can be written as 


(3) - () 
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where p, g are any two distinct odd primes, and the sign is 
such that ¥g = | (mod 4). In view of the previous deter- 
mination of (— 1/p), this is tantamount to the law of quadratic 
reciprocity’, as first formulated by Legendre in 1785 (cf. 
infra, Chap.1V, §V1). Thus the above properties (1) to (V1) 
of w may be said to include that law. Now, assuming its 
validity, we will show that condition (VJ) is satisfied for all 
values of N. 

Put again D = qoq; ... g, aS before, and, for7z = 1, 2, ..., 
7, putg; = +4q;, the sign being determined byg; = | (mod 4). 
We can write —N = ogi... g¢ with qa = lor +2. Then 
we have: 


(S) = (A) (8) - G) EC) 


As noted above, we have (qo/p) = W'(p), where W’ 1s 1, Wo, 
Ws, Or P. according as q is +1, —1, +2 or —2; the proof 
will be complete if we show that w’ is the same as the character 
w occurring in the definition of w. This is so for gg = 1, 
since in that case we have —N = 1 (mod 4), D = |N|, qo = 1, 
w = 1. Similarly, if gg = —1, we have N = 1 (mod 4), 
D = A\N\, qo = 4,6 = wo. Now take the case gj = +2; then 
D = 4|N|, qo = 8, & = Wy or We, and we have to show that 
w(— 1) = w&'(—1). But in that case we have 


—\ 7~{-1 
sgn(qi) = (=), sgn(—N) = sgn(qo) [] (=) 


F i=l \ Gi 


and therefore 
w(— 1) = sgn(go) = #'(- 1), 


which completes our proof. 


' The most lucid exposition of this topic is the one contained in Chap.IIL 
of Dirichlet’s Vorlesungen tiber Zahlentheorie (ed. R. Dedekind, Braunschweig 
1894). For perhaps the simplest known proof of the “law” in question, 
which is Eisenstein’s, cf. A. Weil, Number Theory for Beginners, Springer 
1979, §XII. Cf. also J.-P. Serre, Cours d’Arithmétique, P.U.F., Paris 1970, 
Chap.I. 
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Finally, in order to facilitate the comparison of the above 
results with Euler’s conjectures (cf. §VIID), it is convenient 
to introduce the kernel K, of the character induced by w 
on the multiplicative group modulo 4{|M|, i.e. the set of the 
congruence classes prime to 4|N| modulo 4|N| on which @ 
takes the value 1. Clearly this has the following properties: 


(A) Ky ts asubgroup of index 2 of the multiplicative group modulo 
4\N\. 

(B) —1 belongs to Ky or not, according asN <0 orN > 0. 

(C) If rand s are prime to 4|N|, and if r = s (mod D), then both 
belong to Ky whenever one of them does, and no divisor D' 
of D, other than D, has the same property. 


Conversely, let Ky be a subgroup of index 2 of the mul- 
tiplicative group modulo 4|N\, satisfying conditions (B) and 
(C). Let w' be the function of integers modulo 4|N| such 
that w’(n) = Oif nis not prime to 4|N|, @'(n) = Lifn belongs 
to Ky, and w'(n) = —1 if nis prime to 4|N| and not in Ky. 
For N = 1 or 2 (mod 4), @’ satisfies all the conditions (I) to 
(V) listed above, so that it is no other than w. Now take N 
= —1 (mod 4), so that D = |N\. Define a function w” of 
integers by putting w"(n) = 0 when n is not prime to N, 
and w"(n) = w'(r) when n is prime to N, ris prime to 4|N|, 
and n = r (mod |N)); in view of (C), this is independent of 
the choice of 7, so that w” is well defined; it is easy to see 
that w” satisfies all the conditions (I) to (V), so that it is the 
same as w. This shows that w' in the former case, and w” in 
the latter case, satisfies condition (V1). Thus an odd prime, 
not a divisor of N, is a divisor of the form X* + NY? if and 
only if it belongs to Ky modulo 4|Nj, which is what Euler 
had conjectured in 1744. 


Appendix II 


An Elementary Proof For Sums of Squares 


In 1751, Euler, having proved that every integer is a sum 
of (at most) four squares “in fractis”, i.e. in rational numbers, 
wrote as follows: 

“In Analyst quidem Diophantea pro certo assumi solet nullum 
numerum integrum in quatuor quadrata fracta dispertiri posse, 
nisi etus resolutio in quatuor quadrata integra vel pauciora constet 
... Verum nusquam adhuc evusmodi demonstrationem invent. . .” 
(“In diophantine analysis one usually takes for granted that 
no integer can be split into four rational squares unless it 
has an expression as a sum of four integral squares or less 
... but so far I have nowhere found a proof of this. . .”: Eu.1- 
2.372 in E242 | 1751). This is indeed a question which must 
occur naturally, concerning sums of 2, 3 or 4 squares, to 
any reader of Diophantus, and Fermat had already given 
it some thought, apparently without success, at the beginning 
of his career as a number-theorist (cf. Chap.I1, §V). Euler 
himself had raised it, more particularly concerning sums 
of two squares, in 1745 (cf. Corr.I.312), and again repeatedly, 
concerning sums of four squares, in his correspondence 
with Goldbach (cf. Corr.1.521|1750, 527|1750, 559|1751). 

Here we shall reproduce a proof due to L. Aubry (Sphinx- 
CEdipe 7 (1912), pp.81—84) which applies equally well to 
sums of 2, 3, or 4 squares and a few other quadratic forms. 
We first describe it in geometric terms for sums of 3 squares. 
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Points in R° will be called rational (resp. integral) if they 
have rational (resp. integral) coordinates. To each point 
a = (x, y, z) there is an integral point a’ = (x’, y’, z') ata 
euclidean distance less than 1 from a; for instance one can 
take for x’, y’, z' the integers respectively closest to x, to y 
and to z, in which case the distance from a to a’ is <= V3/2. 

Now assume that an integer N is a sum of three rational 
squares; this is the same as to say that there is a rational 
point a» on the sphere S given by N = x° + y? + 2?, Let 
ay be the integral point (or one of the integral points, if 
there are two or more) closest to ag; its distance from dp is 
<1. Call a, the second intersection of $ with the straight 
line joining dp to ag; it is rational. Let a; be an integral point 
closest to a,; let a) be the intersection of S with the line 
joining a, to a}; etc. It will now be shown that, for some n, 
a, is an integral point. 

Put a = (x, y, z); assume that it is not an integral point, 
and call m the lowest common denominator for x, y, z; write 
x = n/m,y = p/m, z = q/m; we have 


Nm® =n + p + @. 


{I 


Let ag 
put 


(n', p', q') be an integral point closest to aj, and 


r=n-— mn’, sS=p-—mfp’', t=q—- mq, 

N =n? +p? +4",  M = 2(nn'+pp'+qq’). 
The squared distance between ay and aj is then 

ee M 

(r++?) =N+N -— 

m m 


which can be written as m'/m, where m' is an integer; as it 
is <1, we have 0 < m’ < m, and at the same time: 


r++ =mm', M =mN+N’)—m’. 
Now the line joining a» to ag consists of the points 


(n’+dr, p' +s, g' +b); 


294 Number Theory Chap.II, App. 


here the point a, is given by 
0 = (n'+Ar)? + (p' tds? + (g' +AO> -— N 
= mm'd* + (M-2mN')\ + N’- N 


li 


(md — 1)(m'A+N—N’). 


The root A = 1/m corresponds to the point ao, so that the 
other root \ = (N'—N)/m' corresponds to a,. Thus m’ is a 
common denominator for the coordinates of a,; as itis <m, 
this shows that the lowest common denominators for the 
coordinates of ap, @;, dg, etc. make up a decreasing sequence 
of positive integers, which proves our assertion. 

The only property of the quadratic form 


F(X,Y,Z) =X? + ¥24+ 2? 


which has been used in this proof is that, to every non- 
integral point (x, y, z), there is an integral point (x’, y’, 2’) 
such that 


0< |F(x—x',y-y',z-2')) <1. 


This applies equally well, for instance, to the forms X Py yy? 
X? + 2y?, x? — 3Y*. Now, modifying the notation in an 
obvious manner, we will show how the same proof can 
be applied to some quadratic forms F(x) in R", taking integral 
values at all integral vectors x and such that, to every non- 
integral vector x in R", there is an integral vector x’ for 
which 0 < |F(x—x’)| < 1; this will be the case for instance 
for X? + 3Y°, X* + Y? + 2277. xX? + ¥? + Z° + T*. Define 
the bilinear form B(x, y) by 


F(Ax+ py) = NE(x) + AwB(x, y) + MF (9). 


As before, take a rational point a) such that F(a9) = N, and 
an integral point ag such that 


0 < |F(a)—)| < 1. 


Let m be the smallest integer such that n = may is an integral 
point; put aj = n’,r =n ~ mn',N’ = F(n’), M = B(n,n’). 
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We have 


F(ag— ao) = F(m™'r) = F(m7'n-n') = N +N’ - ~ 
writing this as m’/m, we have 0 < m'’ < m, and m’ is an 
integer. 

The line joining ag to aj consists of the points n’ + Ar, 
and its intersection a, with the hypersurface F(x) = N is 
given by 


0 = F(n'+Ar) — N = mm'd? + (M—2mN')\ + N'-N 
= (md—1)(m'h+N-N’). 


As before, a; corresponds to the root X = (N’—N)/m’', so 
that m'a, is an integral point, and m’ is a common denom- 
inator for the coordinates of a,; however, we have now to 
take into account the case {m'| = m, i.e. |F(m™'n)| = 1. Taking 
for F one of the three forms listed above, we may assume 
that we have taken for ao the point, or one of the points, 
whose coordinates are the integers closest to those of a9, so 
that the coordinates of m~'r are <} in absolute value; then, 
for F(m~'r) to be 1, they must all be +4, and 2a, must be 
integral, so that m = 2. In that case there are 2” possible 
choices for aj, and this choice can be made so that N — N’ 
is even; since at the same time m’ must be | or 2, a; is then 
integral. 

The above proof would have been easily understood by 
Euler; perhaps, with a little more effort, it would have been 
understood by Fermat, whose algebraic skills still fell some- 
what short of the required level. That it was discovered so 
late may serve as an encouragement to those who seek el- 
ementary proofs for supposedly sophisticated results. 


Appendix III 


The Addition Theorem For Elliptic Curves 


Our purpose here will be to provide an appropriate back- 
ground for Euler’s theory of elliptic functions. As in Chapter 
II, Appendix II, let FP be an algebraic curve defined over a 
groundfield of characteristic 0. By a differential on T one 
understands any expression w = {- dg, where f, g are rational 
functions on I’; this may be identified with f(dg/dx)dx, where 
one can take for x any non-constant rational function on 
r. 

To every point M on one can attach a rational function 
x on P having a simple zero at M; one can then use it as a 
local coordinate at M, and every function f on I’ can be 
expressed in a neighborhood of M as a Laurent series 
f= 3h, ax’; if a, # 0, nis the order of fat M; it is >0 if f 
has at M a zero of multiplicity n, and it is <0 if f has at M 
a pole of multiplicity —n. We recall that the number of 
zeros of a non-constant function f on Tis equal to the number 
of its poles, both being counted with their multiplicities; if 
this number is N, fis said to be of order N, and the field of 
all rational functions on I is an algebraic extension of degree 
N of the field K(f) of rational functions of f, K being the 
groundfield. 

Let M and x be as above; then every differential w on 
I can be expressed in the neighborhood of M as w = 
x, a;x'dx; it is said to have a zero of multiplicity n at M 
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if a, # 0, n > 0, and a pole of multiplicity —n at M if 
a, # 0,n <0. It is said to be exact at M if it can be written 
there as d(27_,, b;x’); this is so if and only if a_, = 0. A 
differential is said to be of the first kind if it has no pole, 
of the second kind if it is exact everywhere; otherwise it is 
said to be of the third kind. 

From now on we shall take the groundfield to be the field 
C of complex numbers. Then differentials can be integrated 
along any (piecewise differentiable) path on I’; such an in- 
tegral, taken along a closed path, is called a period. 

Let first be of genus 0; then there is on [ a rational 
function x of order 1; rational functions on I are those of 
the form R(x) = P(x)/Q(x), where P, Q are polynomials; T 
can be identified with the “projective straight line” (over 
C), also known as “the Riemann sphere”, i.e. the plane of 
the complex variable x to which is added one point at infinity, 
written x = ©. OnT there is no differential of the first kind; 
ifw = R(x)dx is any differential on I, then (as was in substance 
proved by Euler; cf. above, §§VII and XV) its integral fa, 
which is well-defined up to an additive constant, can always 
be expressed in terms of rational and logarithmic functions 
(rational, logarithmic and inverse trigonometric functions 
if the groundfield is the field of real numbers, as is always 
understood by Euler). This integral is itself a rational func- 
tion, i.e. one can write w = dS(x) with a rational function 
S, if and only if @ is of the second kind. 

Next, let be any algebraic curve; let >y A;, >¥ B, be two 
equivalent divisors on I (cf. Chap.II, Appendix IT); we as- 
sume that A; # B; for all 2, 7. This means that there is on T 
a function x of order N, with the N zeros A; and the N poles 
B; (both being counted with their multiplicities). Then IP 
can be identified with an N-sheeted Riemann surface spread 
over the “Riemann sphere” for x, and the field of rational 
functions on I can be written as C(x, y), where y is a function 
on I, algebraic of degree N over C(x). Take the irreducible 
equation for y over C(x): 


P(x, ¥) = S Fi@)Y*" = Fox) [[ (¥-y,) = 9, 
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where the F; are polynomials, and y, = y. Let w = R(x, y)dx 
be any differential on I; put 


N 
n = > R(x, y;)dx. 
1 


This can be written as $(x)dx, where S is a rational function; 
it is a differential on the “projective straight line”. It is of 
the first kind, and therefore 0, if w is of the first kind; it is 
of the second kind if w is of the second kind. Now take any 
path \ from x = 0 tox = “in the x-plane; its inverse image 
on I consists of N paths 4; going respectively from A; to B; 
onT (for 1 <i <N)if the points A;, B; have been numbered 
suitably. We have in that case: 


N 
|n ~ 2 oe 


Here the left-hand side, and therefore also the right-hand 
side, must be 0 if w is of the first kind; they can be expressed 
rationally in terms of the data if w is of the second kind; in 
the general case the result is the same except that logarithmic 
terms may be needed. This, in substance, is Abel’s theorem. 
If other paths than the A, are used in the right-hand side, 
its value is merely modified by some period of w. 

The relevant case for Euler’s theory of elliptic integrals 
(and for Fagnano’s earlier investigations on the same subject) 
is that of a curve I given by an equation y’ = F(x), where 
F is a polynomial of degree 3 or 4 without double roots (cf. 
Chap.II, Appendix II); then IT is of genus 1, and x is of 
order 2 on T; up to a constant factor, there is on F one and 
only one differential of the first kind, given by w = dx/y; 
it has no zero. Let A + N,B + M be equivalent divisors on 
lr; then, by Abel’s theorem, we have 


" fre= fo 


or, what amounts to the same 


o — fa-fle= [hs 
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if the paths of integration are suitably chosen; otherwise 
both sides differ by a period of w. If w is replaced by a 
differential w’ of the second or third kind, then, for the 
same reason, the difference of both sides can be expressed, 
up to a period of w’, by rational and logarithmic functions 
in terms of the data; it can be so expressed as a rational 
function if w’ is of the second kind. As to the condition 
A+N~B 4+ M, itcan also be written as N ~ M + B — A, 
or, in the notation explained in Chap.II, Appendix II, as 
N =M + B if A is taken as origin. As follows from the 
general theory which has been sketched there, N can be 
expressed rationally in terms of A, B, M; as will be seen 
presently, explicit formulas for this were obtained by Euler 
and were perhaps his main discovery on the subject of elliptic 
integrals. 

In the classical theory of elliptic functions according to 
Jacobi, Abel and their successors, one takes as independent 
variable the value of the integral 


M 
u=| oo 
A 


along any path with the origin A and the end-point M on 
I; for a given M this is defined modulo a period of w; one 
proves that the periods of w make up a lattice L in the 
complex plane C and also that M is uniquely determined 
by the value of u modulo L; thus the mapping u > M 
determines an isomorphism of the group C/L onto the group 
of the points on F under the operation +. “Elliptic functions” 
are then the rational functions on I, viewed as functions 
(meromorphic functions, as their definition implies) of the 
independent variable u. The algebraic formulas which ex- 
press M + M’ in terms of M and M’ (resp. nM = M + + 
+ M in terms of M) can then be viewed as addition (resp. 
multiplication) formulas for the elliptic functions belonging 
to I’. This point of view, which has dominated the theory 
during the nineteenth and part of the twentieth century, 
is irrelevant to Euler’s work on the subject. 

Take again N = M + B; put M = (x, y), N = (x', 9’); 


300 Number Theory Chap.lil, App. III 


taking A, B to be constant, and differentiating (1), we get 
dx — dx’ 


yoy 

this can be expressed by saying that, for given A and B, the 
mapping M > N = M + B isan automorphism of I trans- 
forming the differential w into itself. Clearly such auto- 
morphisms make up a group &, isomorphic to the group 
of the points of I under the operation +; this group is 
simply transitive, i.e. there is one and only one such auto- 
morphism mapping a given point P onto a given point Q, 
viz., the one given by B ~ A + Q - P. 

More generally, let a be any analytic mapping of a neigh- 
borhood of a point P of [ onto a neighborhood of a point 
Q; assume that it maps P onto Q and transforms @ into w. 
Combining it with an automorphism M > M + B mapping 
Q onto P, we get a mapping a’ of a neighborhood of P onto 
a neighborhood of P, leaving both P and @ invariant. Let 
tbe a function on [ with a simple zero at P; taking it as the 
local coordinate at P, we can write w = D%, a;t' 'dt, with 
a, # O since w has no zero. This can also be written as @ = 
du with wu = YjL,a;t'/i; expressing the mapping a’ in 
terms of the local coordinate u, we see that it must be the 
identity, since it leaves both du and the point u = 0 invariant. 
This shows that a mapping such as a must coincide every- 
where locally with a mapping M > M + B. Using either 
analytic continuation or the principle of conservation of 
algebraic identities, one concludes from this that every au- 
tomorphism of I’, transforming w into itself, must belong 
to the group G defined above. 

Define a function f on I by putting f(M) = x whenever 
M = (x,y); also, for M = (x, y), write M7 = (x, —y), so that 
f(M) = f(M~), and that M > Mis an automorphism of 
I, changing » into —w. If P is any point on I’, other than 
A and A, the function 


_ fl) - flP) 
e(M) = Fa) — fla) 
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on [has the zeros P, P~ and the poles A, A, so that we 
have P + P” ~ A + A’ for all P. Now take again N = 
M + B,ie.N~M + B — A, and put M = (x,y), N = 
(x’, y') as before. As N depends rationally upon M, B and 
A, we can express x’ = f(N) as R(x) + yS(x), where R, S are 
rational functions of x, with coefficients which depend ra- 
tionally upon A and B. Call ®(x, x’) = 0 the irreducible 
relation between x = f(M) and x’ = f(N); it must be of 
degree 2 or | in x’. Moreover, we have 


N7~A+A>—-N~A+A™ -(M+B-A)~B° +A-M; 


therefore the mapping M — N” is an automorphism of I, 
exchanging M and N and therefore exchanging x = f(M) 
and x’ = f(N). Consequently the relation P(x, x’) = 0, for 
given A and B, must be symmetric in x and x’, so that it can 
be written in Euler’s “canonical form” (Eu.1-20.71 in 
E 251|1753; cf. above, §XV): 


P(x, x’) = a + 2B(x+x') + (x? +x") 
+ Qxx' + Qexx'(xtx') + Gx?x’”? = 0, 


where the coefficients depend rationally upon A and B. It 
is easy to see that this cannot be of degree 1 in x’, nor 
therefore in x, except for some special values of B (more 
precisely, for 2B ~ 2A). 

If this is written as 


Po(x)x’? + 2P\(x)x’ + Po(x) = 0, 


where Po, P;, Pz are polynomials of degree 2, then, since 
it has a root x’ of the form R(x) + yS(x), the polynomial 
Pi — PoP», must be of the form p*F, with p depending 
rationally upon A and B but not upon M. Conversely, assume 
that this is so. We have then: 


5 O0 = Polx)s! + Pix) = £VPIQ)" — PaPae) = =p VF). 


2 ax’ 


Since ® is symmetric, we have also 
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Therefore, differentiating the relation P(x, x’) = 0, we get 


which can also be written as 


dx _ dx’ 
yoo) 


if the signs of y, y' are chosen properly. In Euler’s language, 
this is expressed by saying that the relation ® = 0 is an 
integral of the above differential equation; it is a “complete” 
(i.e. general) integral, since it contains an arbitrary constant 
(viz., f(B); A is taken as given once for all). 

A noteworthy special case occurs when F is of the form 


F(x) = 1 + mx® + nx’; 


as shown by Euler and Legendre, the general case can always 
be reduced to this one by a suitable substitution x — 
(Ax + p)/(vx + p) (cf. above, §XV). For every M = (x, y), put 
M* = (—x, —y); as M — M* is an automorphism of I, 
transforming w into itself, it must be of the form M — 
M + Bo, with some B, on; as it is of order 2, we must have 
By + Bo = 0, i.e. 2By ~ 2A. As before, take N = M + B, 
x = f(M),x’ = f(N), so that we have P(x, x’) = 0. The auto- 
morphism M — M* maps N onto N* = M* + B; as we have 
f(M*) = —x, f(N*) = —x’, we must have O(—x, —x’) = 0. 
Therefore ® must be of the form 


D(x, x") = aw + y(x7?+x"%) + Qdxx’ + Cx?x”? 


as postulated by Euler (cf. above, §XV). 

As the point A can be arbitrarily chosen, we may assume 
that we have taken A = (0, 1). Put B = (a, b); then the 
relation ® = 0 must be satisfied for x = f(A) = 0, x’ = 
f(B) = a; this gives a = ~—vya*. Also, if Po, Pi, Py are 
again the coefficients of x’?, 2x’, | in ®, we must have 
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Pt — PoP, = p’F; this gives 
ay = —p’, ty = —np’, 8 = y? + al + mp” 
(cf. Eu.1-20.158—161 in E 261|1755). Since a, y, 6, , p are 


defined only up to a common factor, we may assume that 
y = —1. This gives: 


a=@=p, C=nd?, S=1tma’+ n=’, 
O(x, x’) = a? — x? — x? + Qbxx" + na®x?x'? 
as Euler had found (cf. above, §XV). Solving P(x, x’) = 0 
for x’, we get: 


dx + ay 


ro 
2,,27 


= — 5 6= +b 
1 — na*x 


and a similar formula for x in terms of x’, y’, since ® is 
symmetric. The double signs can be determined by observing 
that N depends symmetrically upon M and B, and that it is 
B when one takes M = A. This gives 6 = b and 


, ox tay 
(3) TH na*x?* 

Similarly, writing now the formula for x in terms of x’, 9’, 
and observing again that this must hold for M = A, N = 
B, we get 

_ bx’ — ay’ 

(4) ~ 1 _ nazx’* 

which of course can also be used for expressing y’ in terms 
of x, y, a, b. These are the addition and subtraction formulas 
obtained by Euler (Eu. 1-20.69 in E 251|1753, etc.; cf. above, 
§XV). From this, one can derive multiplication formulas 
in the usual manner. For instance, for My = M + M, i.e. 
M, ~ 2M — A, we have only to put B = M in the addition 
formula, and we get 


2xy 


Xo = f(M2) = lL — nx? 
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Similarly, defining M, by M, = M + M,_, for v > 2, or, 
what amounts to the same, by 


M, ~~ vM — (v-)A 


for all v 2 2, and applying the addition formula repeatedly, 
one obtains the coordinates of M, in terms of x, y for all v. 
One has then, by Abel’s theorem 


My M 
[ oO = »| @ 
A A 


if the paths of integration are taken suitably; otherwise both 
sides differ by a period of . 

All this can be illustrated by considering the “lemniscatic 
case” y° = 1 ~ x*, which played a decisive role in the work 
of Fagnano and in the early work of Euler on this subject 
(cf. above, §XV). Following these authors, we take now as 
groundfield the field of real numbers; all square roots are 
to be taken positively. The lemniscate is the curve 


(x? +7)? — x _ y; 


for convenience we consider only that part of the curve 
which lies in the first quadrant x = 0, y = 0; it is an arc of 
origin 0 = (0, 0) and end-point P = (1, 0). This arc can be 
represented parametrically as 


x= Viz), y= VERP=24) = O02 <1; 


the arc-length on it is given by 


d 
ds = Vie a = 


(cf. Eu.1-20.91 in E 252/1752). Put ¢ = V1 — 24; (x, 9) > 
(z, t) is a One-to-one correspondence between the arc in 
question on the lemniscate and the arc0O <z< 1,f20on 
the curve T given by ¢? = 1 — z*. All integrals on T will be 
taken along segments of that arc. 

The addition and subtraction formulas on I are now given 
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by 

_ viet Wie 

7 1+ 02? 

(Eu.1-20.65 in E 251|1753). Thus, for any given ¢, the map- 


ping z — z’ transforms the arc-length ds into itself. For 
¢ = 1, the subtraction formula gives the mapping 


t 


14+ 2? 


t>2z,= 


which maps O and P onto one another, the arc OP onto 
itself, and preserves the arc-length. For ¢ = z, we get the 


duplication mapping 
2zV 1 — z' 


o> fa = 1+ 2’ 


which changes ds into 2ds. These two mappings were Fa- 
gnano’s discoveries (Fag.11.294,308; cf. Eu.I-20.92—95 in 
E 2521752); the addition formula was first obtained by Euler 
(Eu.1-20.63—64 in E 2511753; cf. Eu.1-20.100 in E 252/1752). 
Fagnano found that, if M is any point on the arc OP of the 
lemniscate, the mapping z — z| maps the arc OM onto an 
arc PM' of equal length; the point Q such that OQ and QP 
have equal length can be obtained by putting either 
z = zj or z% = 1, and is given by z? = V2 — 1; if M is on 
the arc OQ, the mapping z — z. maps OM onto an arc of 
double length. 

It is of some interest to note that Fagnano obtained the 
duplication formula z —~ z, not of course by using the 
addition formula which he did not possess, but (Fag. 11.305— 
307) by combining the two mappings 


zV/2 uv/2 
===, Ll? 2 = 
Vi- 2 V1+ ut 
for which we have 
du dzv/2 dzy duv/2 


View Vi-# Vi-d Vi¢u 


ZU = 
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These formulas define “Landen transformations”, Le. 
transformations (or, in modern language, isogenies) of de- 
gree 2 between the curves (? = 1 — zt and w* = 1 + u’; 
replacing u by nu, where 7 is the eighth root of unity 7 = 
(1 NG one obtains the formulas for the complex mul- 
tiplication by 1 + 7 on the curve ¢? = 1 — z*. This aspect 
of Fagnano’s work, which was to acquire considerable im- 
portance at the hands of Abel, failed to attract Euler’s 
attention. 

Finally, consider a conic section C given by an equation 
y = ax’ + bwith b # 0,a # 0, —1; this is an ellipse for 
a <0, a hyperbola for a > 0. 

Its arc-length is given by 


with p = a/b, q = (a+a’)/b. If we define a curve T by 
v=(l + px?)(1 + qx’), 


then ds is a differential of the second kind on T, and we 
have a one-to-one correspondence between the points 
(x, y) of C for which x 2 0, y 2 0 and the points of I for 
which x = 0, ax” + b = 0,z 2 0. One can then proceed just 
as in the case of the lemniscate, except for the changes 
required, in view of Abel’s theorem, by the fact that the 
arc-length is now of the second kind. 

Take for instance the case b > 0,0 >a > — 1, corresponding 
to an ellipse; one is dealing here with the arc x 2 0, 
y = 0 of the ellipse, with origin P = (0, Vb) and end- 
point Q = (a, 0), where we have put a = V—(b/a); P 
and Q correspond respectively to the points (0, 1) and 
(a, 0) on T. Apply the subtraction formula (4) to the points 
(x, z) and (a, 0) on I (writing x, x’, a, 0, pg instead of x’, 
x, a, b, n, respectively); we get 


TP pe 
v=a/ px 
1 + qx 


which determines a mapping x — x’ of the arc PQ of the 
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ellipse onto itself, or rather onto QP since it maps P and Q 
onto one another; the relation between x and x’ can also be 
written as 


1 + p(x? tx’) + pgx’x” = 0, 


this being of course a special case of the relation denoted 
above by ® = 0. Differentiating this, we get 


(1 +qx'?)xdx + (1 +9x")x’dx’ = 0, 


which implies that x > x’ is, as expected, a one-to-one map- 
ping of PQ onto QP. This can be written: 


At the same time the arc-length on PQ can be written as 
ds = a(|dx|/x') and therefore its transform under the mapping 
x—> x’ is ds’ = a(|dx'|/x). Let M = (x, y) be any point on 
PQ; let M’ be its image under x — x’; then the lengths vA, 
\' of the arc PM and of its image QM’ are respectively given 
by 


in view of the relation obtained above, this gives 
A’ — dX = gaxx’, 


This, too, was Fagnano’s discovery (Fag.11.287—289; cf. Eu.1- 
20.82-90 in E 252|1752). Eventually it was extended by 
Euler to include the effect of the addition, subtraction and 
multiplication formulas on all elliptic integrals, not only 
those of the second kind (Eu.I-20.156—158 in E 261|1755, 
etc.), but even those of the third kind (Eu.I-21.39-56 = 
E 581{1775). All such results may of course be viewed as 
special cases of Abel’s theorem. 


Lagrange (by Delpech). Courtesy, Archives de l'Académie des Sciences, 


Paris. 


Chapter Four 


An Age of Transition: 
Lagrange and Legendre 


§I. 


In 1745 Euler had been hailed by his old teacher Johann 
Bernoulli as “mathematicorum princeps”, the first of mathe- 
maticians (cf. Chap.III, $111). By 1775 he clearly felt ready 
to pass the title on to Lagrange. “It is most flattering to me”, 
he wrote to his younger colleague, “to have as my successor 
in Berlin the most outstanding geometer of this century” 
(loc.cit. Chap.III, §1X). Such was indeed by then the universal 
verdict of the scientific world. “Le célébre Lagrange, le premier 
des géométres” is how LAVOISIER referred to him in 1793 
in an official request to the Convention in behalf of his 
friend (Lag.XIV.314—315) at the onset of the Terror which 
was soon to claim Lavoisier himself as a victim. In the next 
century the title of “princeps mathematicorum” was bestowed 
upon Gauss by the unanimous consent of his countrymen. 
It has not been in use since. 

LAGRANGE was born in 1736 in Turin, where his birth 
was registered as that of Giuseppe Lodovico Lagrangia. On 
his father’s side at least the family was of French origin, 
and Lagrange never used the Italianized form of his name; 
he always signed his letters as Louis (or Lodovico, or Luigi) 
de la Grange (or La Grange) and later as Joseph Louis 
Lagrange or simply Lagrange. Under the House of Savoy, 
Piedmont was largely bilingual, and Lagrange was equally 
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at home in French and in Italian: as he once told d’Alembert 
(Lag. X111.88|1767) his favorite poet was Ariosto. 

His education was the traditional one, based on the classics; 
mathematically he must have been very much on his own. 
Italy’s glorious mathematical tradition was by then a mere 
memory. Even Fagnano, surviving himself in the small pro- 
vincial town of Senigallia, could offer Lagrange nothing 
but formal words of encouragement when the latter, at the 
age of eighteen, sought to initiate with him a correspondence 
from which he obviously expected much more (Fag.I11.179- 
209|1754). In that same year 1754 Lagrange took it upon 
himself to address the great Euler, but got no answer. 

The next year everything started changing. In September 
1755 (cf. Eu.1V A-5.378 = Lag.X1V.147) he was appointed 
to a teaching position for mathematics and mechanics at 
the Artillery School in Turin, where he seems to have had 
some talented pupils, mostly older than himself. At the same 
time his study of Euler’s writings was bearing fruit. On 12 
August 1755 he wrote to Euler as follows: 

“Meditanti mihi assidue ... praeclarissimum librum tuum de 
methodo maximorum et minimorum ad lineas curvas applicata, 
factum tandem est, ut... inciderim in viam longe breviorem 
problemata hutuscemodi resolvendi ... Quanquam enim merito 
haesitandum fuerat, an mika, qui obscurt adhuc nominis sum, te 
tantum virum, omni pene scientiarum genere clarissimum, inter- 
pellare liceret; maximus tamen, ac plane singularis affectus meus 
in te ex operum tuorum studio 1am pridem conceptus, effecit, ut 
opportunam hance illius tibt quomodocumque testandi occasionem 
... de manibus dimittere nullo modo potuerim” [“Meditating 
assiduously upon your most splendid book on the method 
of maxima and minima as it applies to curves, I have at last 
been led to a much shorter way of solving problems of that 
kind ... Well could I hesitate whether such an obscure 
person as I am still ought to approach a man of your em- 
inence, of such high repute in almost every branch of science; 
nevertheless the great and indeed singular affection which 
I have conceived for you since long ago through the study 
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of your works has made it impossible for me not to seize 
this occasion of somehow giving vent to those feelings”: 
Eu.lV A-5.366 = Lag.XIV.138-139]. With this letter went 
a brief sketch which in effect created the classical calculus 
of variations. 

Euler’s response must have exceeded by far Lagrange’s 
expectations. Not only was it prompt and generous; it was 
enthusiastic. “You seem to have brought the theory of max- 
ima and minima to almost its highest degree of perfection; 
my admiration for your penetration knows no bounds”, 
Euler wrote to his young correspondent [“. . . theortam max- 
wmorum ac minimorum ad summum fere perfectionis fastigrum 
erexisse videris; eximiam ingenu tui sagacitatem satis admirari non 
possum”: EulV A-5.375 = Lag. XIV.144|6 Sept.1755]. The 
next year, on his insistent recommendation, Lagrange was 
made a corresponding member of the Berlin Academy, and 
the plan was formed of finding a suitable position for him 
there. The Seven Years’ War, then Maupertuis’ death in 
1759, brought these attempts to nought. 

There followed for Lagrange ten years of “assiduous 
meditation”, with the sole interruption of a visit to Paris. 
Invited by a distinguished Neapolitan diplomat, the Mar- 
chese Caraccioli, to accompany him to Paris and London, 
he left Turin in November 1763, but, having fallen seriously 
ill in Paris, he could not go on to England. He went back 
to Turin in May 1764, stopping in Geneva and meeting 
Voltaire in Ferney on the way home. In Paris he met all 
the French mathematicians, CLAIRAUT, LALANDE, 
d’ALEMBERT, CONDORCET (cf. Lag.XIV.14). Clairaut 
described him to Daniel Bernoulli as “un jeune homme trés 
singulier tant par ses talens que par sa modestie; son caractére est 
doux et mélancolique; il ne connoit d’autre plaisir que l'étude” [“a 
young man, no less remarkable for his talents than for his 
modesty; his temperament is mild and melancholic; he knows 
no other pleasure than study”: Eu.IV A-5.330, note [2]|27 
Dec. 1763]. With d’Alembert he had already started a cor- 
respondence (Lag. XIII.3—4|1759), on rather formal terms 
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at first; in Paris their acquaintance soon matured into an 
intimate friendship which lasted unbroken until d’Alembert’s 
death in 1783. 

By the time Euler left Berlin in 1766 for Petersburg, it 
had become obvious that no one but Lagrange was worthy 
of filling his place; Euler and d’Alembert concurred on this 
point, and Frederic II was quickly convinced. Lagrange 
accepted all the more readily, as nothing except empty 
promises had occurred in Turin since 1755 to improve his 
very modest position there (cf. Lag.XI11.65)1766). He set 
off for Berlin in August 1766, visiting his friends d’Alembert 
in Paris and Marchese Caraccioli in London on his way, 
and landing in Hamburg in October. The next year, unhappy 
about his solitary life as a bachelor in unfamiliar surround- 
ings, he sent for a kinswoman of his in Turin and married 
her in September 1767; to d’Alembert he described this as 
a pure matter of convenience, praising his wife as “an ex- 
cellent housewife and without any pretensions whatsoever” 
(Lag.X111.143]1769). Thereafter his life seems to have been 
quiet and undisturbed, except for occasional spells of poor 
health, until his wife fell gravely ill during the latter part 
of his stay in Berlin. She died in 1783, and d’Alembert, 
from his deathbed, could still dictate a touching last word 
of sympathy to his friend (Lag.XIJ1.377). Euler’s death oc- 
curred in that same year. 

For some time Lagrange had been almost without intel- 
lectual companionship; LAMBERT, whom Euler had re- 
cruited for the Berlin Academy shortly before leaving it, 
and who had become Lagrange’s good friend despite his 
uncouth manners and appearance, had died prematurely 
in 1777 (cf. Lag.XI1I.333-334). In 1786, the king’s death 
deprived the Academy of its founder and protector. In May 
of the next year Lagrange left Berlin forever to establish 
himself in Paris. By that time, as he soon confided to his 
friends and fellow-academicians there, he had lost his taste 
for mathematical research (cf. his Eloge by Delambre, 
Lag.I,p.XXXVII). He had barely passed the age of fifty. 

The remainder of his life was on the whole a happy one. 
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His nearest colleagues became his close friends; in 1792 he 
married the young daughter of one of them, the astronomer 
LE MONNIER; her loving care and devotion was to sustain 
him until his last moments. He watched, at first with sym- 
pathetic interest, then with increasing horror, the progress 
of the revolution, losing to the so-called Terror two of his 
best friends (Lavoisier and Condorcet). After the worst was 
over his merits were again recognized. His teaching at the 
Ecole Normale and at the Ecole Polytechnique was the oc- 
casion for him to work out and write up some of his old 
ideas, going back to his début in 1754, on the theory of 
functions (this making up the main content of his Théorie 
des Fonctions Analytiques of 1797 and of his Legons sur le calcul 
des fonctions of 1806: Lag.IX,X), and to compile his Traité 
de la résolution des équations numériques of 1798 (Lag. VIII). 
He did keep up his interest in the latest mathematical dis- 
coveries; in 1804 he was expressing to GAUSS his admiration 
for the Disquisitiones Arithmeticae of 1801: 

“Vos Disquisitiones vous ont mis tout de suite au rang des premiers 
géometres... J'ai depuis longtemps abandonné ces sortes de re- 
cherches, mais elles ont conservé beaucoup dattrait pour mot, et je 
me contente maintenant de jouir sur cette matiére, comme sur plusieurs 
autres, du fruit des veilles d’autrui ...” [“Your Disquisitiones 
have at once put you on a par with the geometers of the 
top rank ... For a long time I have given up this kind of 
investigations, but they keep much attraction for me, and 
it is now enough for me, on these matters as on several 
others, to enjoy the fruits of the labors of others”: 
Lag.X1V.299; cf. Lag.XIV.300|1808]. Perhaps he felt flat- 
tered by the compliment which the young Gauss, staking 
out his claim as Lagrange’s rightful successor, had paid him 
by adopting for his masterpiece, in latinized form, the same 
title that Lagrange had modestly given to his great Recherches 
a’Arithmétique of 1775. 

Late in life, laden with honors by Napoleon, he undertook 
a revision of his Méchanique Analytique of 1788 (Lag.XI-— 
XII); this task, which he was unable to bring to completion, 
seems to have exhausted him. Mortally ill, on 8 April 1813, 
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he held a long conversation wtih his friends Monge, Lacépéde 
and Chaptal — a truly Socratic dialogue, according to De- 
lambre’s account (Lag.I,p.XLV), where he reminisced about 
the past and welcomed his approaching end. Soon after 
this he lost consciousness; two days later he was dead. 


SII. 


Modest and diffident to a fault, Lagrange offers in many 
ways a complete contrast to Euler, whose ebullient enthu- 
siasm for his own discoveries, as well as for those of his 
contemporaries, never knew any bounds. As he once wrote 
to LAPLACE with obvious sincerity, he “took far more 
pleasure in the work of others than in his own, being always 
dissatisfied with the latter” [“ye jouis beaucoup plus des travaux 
des autres que des miens, dont je suis toujours mécontent”: 
Lag.X1V.71|1777]. Similarly, about a memoir of his on ce- 
lestial mechanics, he wrote to Condorcet: “Please feel entirely 
certain that I attach no value to my feeble efforts and that 
no one can have less pretensions than I have in anything” 
[je vous prie d’étre intimement convaincu que je n’attache aucun 
prix a mes faibles productions, et qu’on ne saurait avoir moins de 
prétentions que je n’en ai en quoi que ce soit”: Lag.XIV.15|1773]. 

Much and perhaps the greater part of Lagrange’s best 
work is directly inspired by that of Euler, which he had 
indeed “meditated assiduously” (loc.cit. §1) for many years, 
and which, already as a young man, he “knew by heart, 
down to the minutest details”, as Lalande had written to 
Euler in 1764 (Eu.IV A-5.330, note [2]). This applies par- 
ticularly to Lagrange’s work on number theory (cf. above, 
Chap.II], §VJ), a subject which started attracting his active 
interest only in 1768 (cf. his letter to d’Alembert, 
Lag.XII1.118—119|1768) and held it for less than ten years. 
His writings on arithmetical topics consist of the following: 


(A) An early paper on Pell’s equation (Lag.I.671—731), 
composed and sent to Turin in the summer of 1768; it 
contained the first proof ever written up for publication of 
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the fact (possibly already proved by Fermat; cf. Chap.II, 
§XIII) that Pell’s equation has always a solution, and there- 
fore infinitely many. Lagrange was later to criticize it as 
“very long and very indirect” (Lag.VII.159 = Fu.1-1.632, 
Art.84); to his annoyance it appeared only in 1773; by that 
time it had been superseded by his later work. 

(B) Three papers, presented to the Berlin Academy in 
1768, 1769 and 1770 (Lag.H1.377—-535,581-652,655—726), 
dealing chiefly with continued fractions and their application 
to the solution of diophantine equations. This included a 
definitive treatment of Pell’s equation (Lag.11.494—496), es- 
sentially equivalent to the one described above in Chap.II, 
§XIII; a proof for the fact that all quadratic irrationals have 
periodic continued fractions (Lag.11.603-615; cf. above, 
Chap.III, §XII); the highly original method of solution of 
equations z> = Ax* + By? (Lag.11.383—399) which has been 
summarized in Chap.I], §XIV; and methods of solution 
for all equations of degree 2 in two unknowns, to be solved 
in integers (Lag.11.655—726). Also included was a method 
alleged to provide a solution for all equations F(x, y) = a 
in integers when F is a homogeneous polynomial of arbitrary 
degree n (Lag.I1.662—696); as Legendre pointed out in his 
Essai sur la Théorie des Nombres of 1798 (I* Partie, §XV, 
Rem. III, no.126), this procedure is not effective for n > 2. 

(C) The three papers listed under (B) had been written 
before Lagrange received Euler’s Algebra (cf. Chap.III, §Vc). 
As soon as this was in his hands, some time during the year 
1770, he formed the plan of getting it translated into French 
by his junior colleague in Berlin, Johann III Bernoulli (a 
grandson of Euler’s teacher Johann I) and accompanying 
this translation with an improved exposition of the main 
contents of the three memoirs in question. This was promptly 
carried out, and a manuscript sent to the printer in Lyon 
in the next year (cf. Lag.XIV.4(1771); it came out in two 
volumes in June 1773. Lagrange’s contribution, making up 
pages 369-658 of volume I] (= Lag. VIE.5—180 = Eu.I- 
1.499-651), bore the title Additions, De ?’ Analyse indéterminée. 

(D) Lagrange’s proof for Fermat’s theorem on sums of 
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four squares (Lag.III.189-—201; cf. Chap.III, §XI); it ap- 
peared in the Berlin Memoirs in 1772. 

(E) A proof of Wilson’s theorem (Lag.II].425-—438; cf. 
Chap.II, §VII); this was read to the Academy in May 1771, 
published in the Berlin Memoirs of 1771, and was commented 
upon by Euler in a letter to Lagrange (Eu.IV A-5.496 = 
Lag.X1V.235-240|1773; cf. Chap.III, §VI). 

(F) Lagrange’s great work on binary quadratic forms, the 
Recherches d’Arithmétique, published in the Berlin Memozrs of 
1775 (Lag.111.697—758; cf. Chap.III, §1X), with a second 
part or rather a supplement published there in 1777 
(Lag. 111.759—795). 

(G) Lastly, a treatment of the diophantine equation y* = 
x* — 2x, or equivalently of X* — 2¥* = +2’, originating 
from the work of Fermat (cf. Chap.II, §XV and App.V) 
and lately considered by Euler (cf. Chap.JII, 8X VI). This 
is no more, and no less, than a carefully done exercise on 
Fermat’s method of infinite descent, but applied for the 
first time to an equation of genus | and of rank >0 (ie. 
with infinitely many solutions). This was read to the Academy 
in 1777 and appeared there in the same year (Lag.IV.377— 
398). 


SII. 


As appears from this list, the greater part of Lagrange’s 
contributions to arithmetic dealt with questions already 
considered by Fermat and Euler; these matters have been 
discussed in some detail in Chapters II and III and need 
not be taken up again here. We also refrain from describing 
Lagrange’s contribution to the theory of continued fractions 
as such, and his application of that theory to the numerical 
solution of algebraic equations and to indeterminte equations 
of degree >2, as these belong to so-called “diophantine 
approximation” rather than to number theory proper. On 
the other hand, Lagrange’s work on binary quadratic forms 
deserves a more thorough description than could be given 
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in Chap.III, 8IX, where it was viewed only through the 
failing eyes of the aging Euler. 

In 1768, announcing to d’Alembert his first big memoir 
on indeterminate equations (the first one of those listed 
under (B) in §II), he had written that he had thoroughly 
exhausted the subject of equations of degree 2 in two un- 
knowns, to be solved either in fractions or in integers {“Je 
crois avoir entiérement épuisé cette matiére, sur laquelle M. Euler 
parait s’étre vainement exercé”: Lag.X111.125|6 Dec.1768, and 
again a few weeks later: “Je crows n’avoir presque rien laissé a 
désirer sur ce sujet”: Lag.XI11.128|28 Feb.1769]. Of course 
he was right in pointing out that Euler, in dealing with such 
equations, had always been compelled to assume that at 
least one solution was known to him beforehand (Lag.11.378— 
379|1768 and Lag.X1II.128|1769; cf. Chap.III, §XIII), 
whereas his own methods were fully effective without any 
such limitation. Nevertheless he was soon to discover that 
he had not yet said the last word on this subject; already in 
1770 he was publishing a variant of his own solution under 
the title “Nouvelle méthode pour résoudre les problémes indéterminés 
en nombres entiers” (Lag.1J.655—726). The next year, working 
out still another variant for insertion into his Additions 
to Euler’s Algebra, and inspired (as he says) by the method 
of descent which he had used to solve the equation 2? = 
Ax® + By? in rational numbers, he introduced a new idea 
of such far-reaching importance that it has dominated the 
whole subject of quadratic forms ever since; in modern 
language it consists in an equivalence concept with respect 
to which forms of given discriminant fall into classes, coupled 
with a process of reduction which shows the number of 
classes to be finite (Lag. VII.125—-126 = Eu.I-1.603, Art.70). 

In the Additions this idea had been presented only inci- 
dentally, with no stress laid upon it. A few years later, having 
better realized its scope, Lagrange devoted to it the whole 
of his Recherches of 1775. As we have seen, Euler greeted 
this with enthusiasm and lost no time in summarizing it in 
his own way (cf. Chap.III, §1X). 
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SIV. 


Understandably, what impressed Euler most in the Re- 
cherches was that it contained proofs for some of the facts 
he had discovered empirically in his younger days (cf. 
Chap.ITI, §1X); in this he had the excuse that this was the 
aspect chiefly emphasized by Lagrange himself in discussing 
the consequences of his main idea. To do full justice to 
Lagrange, one must view his memoir in the light of the 
later developments which it inspired, and more specifically 
of the work of Gauss and his successors. 

Fermat, the creator of the theory of binary quadratic 
forms, had never ventured beyond a few specific cases of 
the type X* + AY’. Euler had systematically considered forms 
of the type mX* + n¥Y* but no others, except that he could 
not avoid the form X* + XY + Y? in dealing with X* + 3Y° 
(cf. Chap.IT, §VIII). Lagrange’s initial step, and in itself a 
momentous one, was to realize that a coherent general theory 
required the simultaneous consideration of all forms 
aX? + bXY + cY®. 

No less obvious perhaps to our eyes, but no less essential, 
was the recognition of the fact that two such forms f, F 
“represent” the same integers (i.e., that the equations 
f(x, y) = m, F(x, y) = m have solutions in integers for the 
same values of m) whenever there is a relation 


(1) F(X, Y) = flaX + BY, yX + 8Y) 


where a, B, y, 5 are integers satisfying ad — By = +1. In 
modern terms, this is accounted for by the fact that the 
substitutions 


(2) (X, Y) > (aX + BY, yX + BY) 


with a, 8, y, 6 as above, are invertible, the inverse of (2) 
being the substitution 


(X, Y) > (4=(6X- BY), +(-yX+eayY)). 


For forms f, F related as above, Lagrange, disinclined as 
always from bringing in new concepts and new terminology, 
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has the rather inadequate phrase “forms which can be 
transformed into one another”; Gauss introduced for them 
the term “eguivalent forms” which has been in use ever since. 
Lagrange shows that such forms have the same discriminant 
b? — 4ac; he is also well aware of the fact that the relation 
between them is truly “an equivalence relation” in the modern 
sense, i.e. that it is symmetric and transitive; in modern 
language this amounts to saying that the substitutions (2) 
make up a group, viz., the group GL(2, Z) of automorphisms 
of the additive group of vectors (x, y) with integral x and y. 
For brevity’s sake, if S is the substitution (2), we shall write 
det S for its determinant a6 — By, F = f°S for the relation 
(1), and & for the group GL(2, Z). The distinction, made 
later by Gauss, and so essential to his purposes, between 
“proper” and “improper” equivalence is not made by La- 
grange, nor yet, some time later, by Legendre; it depends 
upon the fact that the substitutions S in Y for which 
det S = | make up a subgroup SL(2, Z) of & of index 2, 
two equivalent forms f, F being called “properly equivalent” 
if there is S in that subgroup for which F = f° S, and 
“improperly equivalent” otherwise. 

Once this equivalence concept has been clarified, forms 
with a given discriminant can be distributed into classes, two 
forms being put into the same “Lagrangian” (resp. “Gauss- 
ian”) class if and only if they are equivalent (resp. properly 
equivalent); here again the first use of the word “class” in 
this sense belongs to Gauss. A “Lagrangian” class is either 
a “Gaussian” class, or it consists of two such classes. Here 
we Shall use the word “class” exclusively in the Lagrangian 
sense. 

Still another distinction has to be pointed out between 
the usage of Lagrange and that of Gauss. For Gauss the 
binary quadratic form (a, }, c) is defined as the form aX? + 
2bXY + cY?, with an even middle coefficient 26; Gauss always 
insisted that this was the most appropriate notation, clas- 
sifying forms according to their “determinant” b* — ac. On 
the other hand, Lagrange took as his starting point the 
forms aX? + bXY + cY®, where 6 may be even or odd, 
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while sometimes giving precedence to those with even 
middle coefficient. Here we will write (a, b, c) for the “La- 
grangian” form aX* + bXY + cY’, calling b° — 4ac its dis- 
criminant (which has the same parity as b). Such forms, 
after disappearing from the literature (under the influence 
of Gauss) during much of the nineteenth century, became 
prominent again after Dedekind re-interpreted binary 
quadratic forms in terms of the ideal-theory of quadratic 
fields. Of course the distinction between “Lagrangian” and 
“Gaussian” forms is merely one of notation as long as fields 
(or rings) of characteristic 2 are not considered, since a 
form f = aX? + bXY + cY° with b odd can always be replaced 
by the form 2fif one wishes to consider it within the frame- 
work of the Gaussian theory. For brevity we will exclude 
the forms (a, 6, c) whose discriminant 6? —4ac is 0 or a 
square; these are the forms which can be written as products 
of two linear forms with integral coefficients. 

Another observation of some moment, particularly in view 
of the problems previously studied by Euler, concerns the 
integers “properly represented” by a form f, i.e. those which 
can be written as f(x, y) with x prime to y; these are the same 
for two equivalent forms, since a substitution in GL(2, Z) 
maps every pair (x, y) with x prime to y onto a similar pair. 
For f = (a, b, c), both a and ¢ are properly represented by 
f, since we have a = f(1, 0), ¢ = f(0, 1); therefore, if F = 
(A, B, C) is equivalent to f, both A and C are properly rep- 
resented by f. Conversely, let m be an integer properly 
represented by f, so that we can write m = f(a, y) with a 
prime to y; then there are integers 8, 5 such that 
ad — By = 1, and m is the coefficient of X* in the form F 
defined by (1). Now let d be a “divisor” of the form /f, by 
which we mean, with Euler, any divisor of an integer m 
properly represented by f, provided it is prime to the dis- 
criminant D of f. Then there is F, equivalent to /, whose 
first coefficient is m, so that we can write F = (m,n, p); as 
F has the same discriminant as f, we have 


D =n? — 4mp = n® — 4d(m/d)p; 
therefore d is the first coefficient of the form (d, n, (m/d)p) 
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with the discriminant D, and is properly represented by it 
(cf. Chap. III, §1X). 

As Euler rightly observed (cf. Chap.III, §1X), Lagrange’s 
most decisive step consists then in showing that every form 
(a, b, c) can be transformed, by successive substitutions 


(X, Y) > (X-rY,Y) 
and 
(X, Y) rd (X, Y—sX) 


into an equivalent form (A, B, C) for which |B| is <|A| and 
<|C| (Lag. VI1.125-126 = Eu.I-1.603—-605, Art.70; Lag IH. 
697—700). His proof is the one described above in Chap.III, 
SIX. 

Lagrange had no word to designate the forms (A, B, C) 
satisfying the above conditions; we shall call them “reduced 
in the Lagrangian sense”, or simply reduced since no confusion 
is to be feared in the present context (Gauss, who introduced 
the word, used it in a somewhat different sense). For such 
a form, as we have seen (Chap.III, §1X), A, B and C are 
bounded in terms of D = B® — 4AC, so that it is easy to list 
all such forms with a given discriminant; Lagrange’s proof 
shows that there is at least one reduced form in each class. 
The process, described in Chap.II], §1X, by which a given 
form is transformed into a reduced one may be called “La- 
grangian reduction”. 

Thus far Euler had been able to follow Lagrange. Perhaps 
he attached less value to the last problem solved in the Re- 
cherches, or else he was deterred at this point by the com- 
plicated calculations which this involved. In the eyes of La- 
grange and of his successors, however, this was the 
indispensable complement of the process of Lagrangian 
reduction, the question being to obtain a set of representatives 
(in the strict sense) for the classes of forms with a given 
discriminant, i.e. a set of forms containing one and only 
one form from each class. Lagrange solved it by inventing 
a procedure for obtaining all the reduced forms equivalent 
to a given one without having recourse to trial and error 
(“sans aucun tatonnement”: Lag.111.737). Clearly, by applying 
such a procedure successively to each form in a list of all 
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the reduced forms with a given discriminant, and eliminating 
the superfluous ones accordingly, one will obtain a set of 
representatives as desired. 

At first Lagrange seems to have sought a unified treatment 
for all classes of forms, using for this a kind of canonical 
decomposition for substitutions in GL(2, Z); he soon dis- 
covered, however, that in this respect forms behave quite 
differently according to the sign of their discriminant. As 
he knew, a form f has a discriminant D < 0 if and only if 
all the values f(x, y) it takes, for integers x, y not both 0, have 
the same sign; since Gauss such forms are called definite, 
while forms with D > 0 are called indefinite. Clearly, among 
definite forms, it is enough to consider those with only pos- 
itive values; for such a form (a, 5, c), a and ¢c must be >0. 
Lagrange found that, if F = (A, B, C) is definite and reduced 
(in the “Lagrangian sense”), then there are no reduced forms 
equivalent to F except its transforms (A, +B, C), (C, +B, A) 
by the substitutions (X, Y) > (X, +Y), (X, Y) > (Y, +X). 
His proof for this was later superseded by one of Legendre’s 
(in his Essai of 1798, I® Partie, $VI1D) which is both simpler 
and more informative; it will be described in Appendix II. 

For indefinite forms the question is more difficult, and 
in a sense more interesting; Lagrange’s method for dealing 
with it is manifestly inspired by his own solution of Pell’s 
equation, which indeed may be regarded as the special case 
in which the given reduced form is X* — NY*. In his Essai 
of 1798 (I° Partie, §XII1I), Legendre gave an improved pre- 
sentation of Lagrange’s solution by making more explicit 
use of the theory of continued fractions; this was then 
adapted by Gauss to his own concept of “proper equivalence” 
(Disq. Art. 184-193; cf. Dirichlet’s definitive presentation in 
the Vorlesungen iiber Zahlentheorie, ed. R. Dedekind, Braun- 
schweig 1894, §$72-82). Lagrange’s original treatment of 
the problem will be described in Appendix III. 


§V. 


Adrien-Marie LE GENDRE (or LEGENDRE) was born 
in Paris in 1752; he died there in 1833. “The illness that 
put an end to his life was a long and painful one”, said 
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POISSON at his funeral; “it was our colleague’s often- 
repeated wish that one should mention only his works when 
speaking of him, and indeed they are all his life”. The year 
before he died, however, writing to the young JACOBI to 
congratulate him on his marriage, he indulged briefly in 
some reflections about his own personal life: “I married at 
a much later age than you, after a bloody revolution which 
had destroyed what little fortune I possessed; we had serious 
embarrassments and some really difficult moments, but my 
wife helped me mightily to put my affairs progressively in 
order .. . so as to secure ... a small fortune, the remains 
of which, after great losses incurred in subsequent revo- 
lutions, will still be adequate for taking care of my needs 
in my old age, and of those of my beloved wife when I 
shall be no more. But I have been talking about myself too 
much ...” [“Je me suis marié beaucoup plus tard que vous et a 
la suite d'une révolution sanglante qui avait détruit ma petite fortune; 
nous avons eu de grands embarras et des moments bien difficiles 
& passer mais ma femme m’a aidé puissamment a restaurer pro- 
gressivement mes affaires ... de maniére ad me procurer brentét 
une existence honorable et une petite fortune dont les débris, aprés 
de nouvelles révolutions qui m’ont causé de grandes pertes, suffiront 
encore pour pourvoir aux besoins de ma vieillesse et suffiront pour 
pourvoir @ ceux de ma femme bien-aimée quand je n’y serat plus. 
Mais c'est trop parler de mot. . .”: Jac.1.460|30 June 1832]. In 
1792, indeed, he had married a girl, not quite half his age 
at. the time, who was to survive him for 23 years. 

He had early attracted the attention of the senior math- 
ematicians in Paris. Born into a well-to-do family, he never- 
theless accepted a teaching position at the Ecole Militaire 
and held it from 1775 to 1780. In 1782 a prize-winning 
essay on ballistics, which he had sent to Berlin, caused La- 
grange (Lag.X1V.116|1782) to inquire about him from La- 
place, who answered with high praise (Lag.XIV.121{1783); 
in the same year Legendre was appointed to succeed Laplace 
on the lutter’s promotion to associate membership, as “ad- 
junct” (“adjoint mécanicien”) with the Paris Academy; he was 
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himself promoted in 1785. The Academy was abolished in 
1793, and eventually restored as part of the Institut de 
France; as he was to write to Jacobi, he experienced some 
difficulties in supporting himself during the troubled times 
of the French revolution, occupying various positions which 
he seems to have owed largely to his acknowledged skill in 
numerical computation. Already in 1787 he had been a 
member of an Academy committee on geodesy, whose busi- 
ness brought him to London where he was made a Fellow 
of the Royal Society; in 1791, and again from 1794 onwards, 
he was a member of committees which eventually established 
the metric system. From 1799 to 1815 he was examiner for 
the Ecole Polytechnique—an important position in those 
days, but hardly a prominent one. In 1813 he succeeded 
Lagrange on the Bureau des Longitudes, and held that 
position until his death. 

One utterance of Poisson’s, in his speech delivered at 
Legendre’s funeral, might not have won his late colleague’s 
approval. “Les questions relatives aux propriétés des nombres”, 
he said, “isolées de toute application, n’ont qu’un seul attrait, a 
la vérité bien puissant sur les mathématiciens: V’extréme difficulté 
qu'elles présentent” (“Questions pertaining to the properties 
of numbers, remote from any application, have only one 
attraction, but a potent one, it must be acknowledged, for 
mathematicians: they are exceedingly difficult”]. This can 
be contrasted with Legendre’s own statement, in the preface 
of his Essai of 1798 on number theory: “Z/ est @ croire ... 
qu’Euler avoit un gotit particulier pour ce genre de recherches, et 
quil s’y livroit avec une sorte de passion, comme il arrive a presque 
tous ceux qui s’en occupent” (“It appears ... that Euler had 
a special inclination towards such investigations and that 
he took them up with a kind of passionate addiction, as 
happens to nearly all those who concern themselves with 
them”: Essai sur la Théorie des Nombres, Paris An VI (= 1798), 
p.vi]. Surely, when writing these lines, Legendre must have 
been giving vent to his own feelings about what remained 
always, along with elliptic functions, his favorite subject. 
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§VI. 


Legendre’s first venture into number theory was a lengthy 
essay submitted in 1785 to the Paris Academy and published 
in 1788 under the title “Recherches d’Analyse Indéterminée” in 
the Mémoires de Mathématique et de Physique . . . de VAcadémie 
Royale des Sciences, Année 1785, pp.465—559: “an excellent 
memoir”, as the young Gauss wrote to his teacher Zim- 
mermann in May 1796 after discovering it in the Géttingen 
University library [“eine vortreffliche Abhandlung”: cf. 
L. Schlesinger, Ueber Gauss’ Arbeiten zur Funktionentheore, 
p.21, in Gau.X-2). By the time Legendre composed this 
essay, Euler was dead and Lagrange had ceased to be 
active in that field. Obviously Legendre had been an eager 
student of their work, including at least the first volume 
of Euler’s Opuscula Analytica published in 1783; this con- 
tained on page 84 (= Fu.J-3.512 in E 552; cf. Chap.III, 
$VIII and Appendix 1) a statement so close to the law of 
quadratic reciprocity as to be almost undistinguishable 
from it. 

As years went by, Legendre launched a more ambitious 
project; this took the form of a full-sized volume, where 
he sought to give a comprehensive account of number 
theory as he saw it at the time, including, beside his own 
research, all the main discoveries of Euler and Lagrange, 
as well as numerical evidence (in the form of extensive 
tables) for many results whose proofs he felt to be shaky. 
It was published in Paris in 1798 under the title Essaz sur 
la théorte des nombres: “I am not offering it as a complete 
treatise”, the author said in his preface, “but merely as 
an essay purporting to show roughly the present state of 
the theory” [“Je le donne non comme un traité complet, mais 
sumplement comme un essai qui fera connoitre a-peu-pres Vétat 
actuel de la science”: pp.ix—x]. 

Following the appearance of Gauss’s Disquisitiones in 1801, 
the work underwent an extensive revision, which however 
did not affect its main features; this led to a second edition 
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in 1808. Supplements, first printed in the Memoirs of the 
Paris Academy, were added to this subsequently. Finally 
a definitive edition appeared in 1830 in two large volumes; 
feeling that he had now done his utmost for the improve- 
ment of his treatise [“L’owvrage ayant ainsi regu tous les 
perfectionnements que l'auteur a pu lui procurer”: p.v], Legendre 
let it appear with the title Théorie des Nombres. By then, as 
his younger contemporaries well knew, Gauss’s Disquisitiones 
had made it almost wholly obsolete. 

It is said that Gauss’s principle, in publishing his work, 
was to remove the scaffolding before letting the public 
inspect the building. Of Legendre’s number theory it may 
fairly be said that there is often more scaffolding in it 
than solid masonry; even the foundations are at times so 
shaky that the visitor feels seldom quite safe, in spite of 
the warning signs which Legendre does insert from time 
to time. In discussing its contents it will be well to begin 
with the Recherches of 1785, where matters are more clearcut. 
This memoir is divided into four parts, the first two of 
which deserve only the briefest mention. Part I, on con- 
gruences of higher degree modulo a prime (pp.465—490) 
brings only a number of easy consequences of Euler’s 
results described above (Chap.III, §VI). Part II (pp.490- 
507) gives procedures for splitting a polynomial with integral 
coefficients into factors of lesser degree, over Q or over 
quadratic extensions of Q. 

Part III of the Recherches (pp.507—513) is devoted firstly 
(pp.507—509) to Lagrange’s solution of the equation 
ax” + by? + cz? = 0, described above (Chap.II, §X1V); this 
is supplemented (pp.509—5 13) by the proof of the following 
important criterion, which is one of Legendre’s main claims 
to fame: 


Theorem. Let a, b, c be three integers, not all of the same sign 
and such that abc 1s square-free. Then the equation 
ax’ + by? + cz? = 0 


has a solution in integers x, y, z, not all 0, if and only if — bc, 
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—ca, —ab are quadratic residues modulo |a\, modulo |b| and modulo 
|cl, respectively. 


While Legendre’s proof for this is straightforward, it is 
somewhat artificial; for another proof, cf. Appendix I 
(cf. also the fuller treatment given to this equation in 
Dirichlet’s Vorlesungen iiber Zahlentheorie, ed. R. Dedekind, 
Braunschweig 1894, $157). 

In Part IV (pp.513-—552) the above criterion is applied 
(pp.513—523) to an attempted proof for the theorem which 
Legendre was later to call “a law of reciprocity between 
primes” and which has remained famous as “the law of 
(quadratic) reciprocity”. ‘This establishes the relation between 
the quadratic residue character of an odd prime modulo 
another one g, and the character of g modulo p. In order 
to express this, Legendre introduced in 1785 the notation 
N‘%?- D2 where p is any prime and N an integer prime to p, 
explaining that this was not to be understood literally but 
meant the smallest residue (in absolute value) of N?~)/? 
modulo p, which is + | or — 1 according as N is or is not a 
quadratic residue modulo p. In 1798 he replaced this cum- 
bersome and inadequate notation by the celebrated “Le- 


defined as having that same meaning’ (cf. Chap.HI, §VI). 

The reasons for the partial success of Legendre’s idea 
will be explained in Appendix I. What Legendre does is to 
distinguish eight cases according to the values of p and of 
qg modulo 4 and to the value of (p/q); this same arrangement 
was observed by Gauss in his first proof of the law in question 
(Disq.Art.136). In each of these cases, Legendre introduces 
an appropriate equation of the form 


ax® + by? +7 =0 


with a = 6 = c = 1 (mod 4); this can have no non-trivial 


' For reasons of typographical convenience, this will be printed here 
as (N/p), except in displayed formulas. 
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solution, since the congruence 
ax” + by? + cz? = 0 (mod 4) 


has none. Therefore, by Legendre’s criterion, —bc, —ca 
and ~abcannot simultaneously be quadratic residues mod- 
ulo |a|, modulo |6| and modulo |c|, respectively. In each case 
Legendre tries to choose a, b, c so that this will lead to the 
desired conclusion. 

Take firstly p = 1, ¢ = — 1 (mod 4), (p/g) = — 1, and the 
equation x* + py? ~ gz” = 0; as (—1/q) = —1, we have 
(—p/q) = 1; therefore (q/p) cannot be 1; it must be — 1], as 
required by the reciprocity law. The same reasoning applies 
to the case gq = q’ = —1 (mod 4), (g/q’) = 1, and to the 
equation x* — gy — q’z = 0. 

Now take g = q' = — 1 (mod 4), (¢/q') = —1. In that case 
Legendre introduces the equation px” — gy? ~— q’2 = 0, 
where p is an auxiliary prime subject to the conditions 


eam. (F) = (5) 
= 1 (mod 4),(~]} = -1,{-] = -1. 
p= i (mo (4 q 


Assuming that there is such a prime, and using what has 
been proved before, the conclusion (q'/q) = 1 follows. For 
this to be valid, there has to be a prime in one of the arithmetic 
progressions 

{4qq'x+mlx = 0, 1, 2, ...} 


where one takes for m all the integers >0 and <4qq' which 
are =] (mod 4) and non-residues modulo g and modulo gq’. 

Legendre was rightly convinced that every arithmetic 
progression {ax + 5}, with @ prime to 6, contains infinitely 
many primes. “Perhaps it would be necessary to prove this 
rigorously”, he had written in 1785 (Recherches, p.552). “We 
need have no doubt about it”, he wrote in 1798 (Essaz, p.220), 
this time attempting to support his statement by an altogether 
unconvincing argument (Essai, pp.12—16) which he later 
expanded, even more disastrously, into a whole chapter of 
his Théorie des Nombres (t.11, §X1, pp.86~—104). It was left for 
DIRICHLET to prove the theorem of the arithmetic pro- 
gression, as he did in 1837 by a wholly original method 
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(Dir.J.315-342) which remains as one of his major achieve- 
ments; by the same method he also proved (Dir.1.499—502) 
that every quadratic form ax* + bxy + co represents infinitely 
many primes provided a, b, c have no common divisor, as 
had been announced by Legendre (Théorie des Nombres, t.11, 
pp.102—103). 

In the case we have just discussed, the theorem of the 
arithmetic progression, taken by Legendre as a kind of axiom, 
gave at least some semblance of justification to his argument; 
but there is worse to come. Take the case of two primes p, 
p', both =1 (mod 4). Legendre seeks to treat it by means 
of the equation px” + p’y> — gz” = 0, where g is an auxiliary 
prime satisfying 


q = —1 (mod 4), (4) = 1. (4)- -1, 


or alternatively by means of x° + py’ — p'qz = 0, with q 
satisfying 


g= -14mod 4), (£) = —] 


(Recherches, pp.519—520; Essai, pp.216-217 and 220-221; 
Théorie des Nombres, t.1, pp.233—234). Is there such a primer 
Clearly its existence would follow from Dirichlet’s theorem 
and the reciprocity law; but it is doubtful (as Gauss pointed 
out: Disq.Art.297) that it could be proved otherwise. Thus 
Legendre’s method of proof ends up unavoidably with a 
piece of circular reasoning out of which there seems to be 
no escape. This did not prevent him, in a letter of 1827 to 
Jacobi, from complaining, as bitterly as unjustly, that Gauss 
had “claimed for himself” the discovery of the reciprocity 
law (Jac.1.398). He must have realized, however, that Gauss’s 
criticism of his proof had its validity; in his Théorie des Nombres 
of 1830 he chose to be on the safe side by inserting (t.II, 
pp.57—64) Gauss’s third proof (cf. Gau.11.3-8/1808), and, 
for good measure (t.II, pp.391—393) the “cyclotomic” proof 
communicated to him by Jacobi at the beginning of their 
correspondence (Jac.1.394|1827). 
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The remainder of Part IV of the Recherches of 1785, or 
at least its most valuable portion (pp.531—548) is devoted 
to a series of observations (“quelques remarques assez singuliéres”, 
as he calls them, p.531; “Vébauche d’une théorie entiérement 
nouvelle”, he was to call this in his Essat of 1798, p.ix) con- 
cerning the representations, not only of integers, but also 
of binary quadratic forms as sums of three squares. These 
were based on the numerical evidence contained in the tables 
at the end of the memoir (pp.553—559); “on peut les regarder 
comme autant de théorémes”, he wrote, “car quoique je n’en donne 
pas la démonstration complete (sic!], us sont fondés au moins sur 
une induction trés-étendue” [“one may regard them as so many 
theorems, for, while 1 am not giving the full proof for them, 
they are at least based on wide numerical evidence”: p.53 1]. 
Even Euler might not have been quite so casual about such 
matters. 

This sketch was later expanded into the whole third section 
of the Essai of 1798 (pp.321-—400) and again rewritten for 
the 1808 edition; by that time the same ground had been 
covered, far more concisely, more solidly and more fully, 
in Gauss’s Disquisitiones (Disq.Art.266—293). Nevertheless, 
was it a mere coincidence that the young Gauss initiated 
his study of sums of three squares in July 1796 (cf. nos. 17 
and 18 of his Tagebuch, Gau.X-1.496—497) a few weeks after 
discovering Legendre’s Recherches in the Gottingen library? 
At any rate Gauss’s first step (zbed., no.17) was to verify the 
relationship, noted by Legendre in 1785, between the rep- 
resentations of an integer N as a sum of three squares and 
certain binary quadratic forms of determinant —N, also 
decomposable into sums of three squares. From this one 
can derive relations between the number of representations 
of N by sums of three squares and suitable class-numbers 
for forms of determinant —N; having discovered such re- 
lations empirically in 1785, Legendre sought to establish 
them in the Essai of 1798 (pp.366—400) and again in his 
later publications; this included in particular Fermat’s cel- 
ebrated assertion that every integer is a sum of three “tri- 
angular numbers” (Essaz, p.399; cf. above, Chap.II, XIV). 
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As Gauss rightly observed, these attempts involved Legendre 
in serious complications from which it might not have been 
easy for him to extricate himself, and which left his main 
conclusions still open to doubt [“compluribus difficultatibus 
implicatus est, quae effecerunt ut theoremata palmaria demon- 
stratione rigorosa munive non licuerit”: Disg., Addit ad Art.288— 
293, Gau.1.466]. Gauss, of course, gave complete proofs for 
such results, and for many more of similar import, in the 
Disquasitiones. 

On another matter of great importance Legendre also 
appears as a forerunner of Gauss, and more so perhaps 
than Gauss was willing to concede. Lagrange had shown, 
as we have seen (cf. above, §IV), that, if d is any “divisor” 
of a quadratic form, then it can be properly represented 
by some form with the same discriminant. In particular, let 
dbea “divisor” of X? + NY?;i.e. such that — Nisa quadratic 
residue modulo d (cf. Chap. III, VIII); then d can be prop- 
erly represented by some form f = aX”? + 2bXY + cY? of 
determinant 5? — ac = —N. Let d’ be another such divisor, 
similarly represented by a form /’; assume that d’ is prime 
to d. As —N is a quadratic residue modulo d and modulo 
d’, itis such also modulo da’; therefore dd' can be represented 
by some form F, also with the determinant —N. It was Le- 
gendre’s great merit to perceive that F depends only upon 
f and f’; in other words, there is a form F such that every 
product of two mutually prime integers, properly repre- 
sented by f and by /’ respectively, is properly represented 
by F. 


In order to do this, Legendre takes two forms 


f(X, ¥) = aX? + 26XY + c¥’, 
fF (8,¥') = aK? + QDR’ + c¥”? 


with the same determinant 6 = 6” — ac = b’? — a'c’. Putting 
Z = aX + bY, Z' = a'X' + b'Y', he gets: 


af(X,¥) = Z? -— 8Y*, a’ f'(X', ¥') = 2” — bY”. 


To this he applied “Brahmagupta’s identity” (cf. above, 
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Chap. III, §VIID); this gives: 
AX YSP(XY') = 4 [(ZZ' + 8YY’)® — &(ZY'+¥Z')’]. 


The right-hand side can be written as AU” + 2BUV + CV? 
by putting A = aa’, C = (B? — 8)/A, and further 


V=Z2Y' + YZ’, AU + BV= ZZ' = 8YY', 


where B has still to be determined suitably. An easy cal- 
culation gives U = XX’ + mYX' + m'XY' + nYY', with m, 
m', n given by 
m= 2B m' = ; n=mm' *C 
a a 
(Essai, [V* Partie, HII, nos.362—369). 

Assume now that a and a’ are mutually prime; if not, as 
Legendre observes, one can replace f’ by an equivalent form 
so as to fulfil that condition, provided of course a’, 26’ and 
c’ have no common divisor. Take then B = +0 (mod a), 
B =)’ (mod a’); then we have 


B? = 6? = § (mod a), B® = b’* = 8 (mod a’), 


so that all the coefficients in the above formulas are integers. 
In view of the double sign, Legendre had thus obtained 
two forms 


F(U, V) = AU? + 2BUV + CV? 


with the determinant B? — AC = 6 and the property that 
there is an identity 


A(X, YF'(X'. Y’) = FIB(X, Y; X', Y'), BUX, Y; X’, Y)] 


where B, B’ are two bilinear forms in X, Y and X’, Y’. Brah- 
magupta’s identity appears here as a special case; at least 
another one had been noticed by Euler (cf. his Algebra, Eu.I- 
1.424, Art.178, and above, Chap.III, §XIV). 

Anyone familiar with the Gaussian concept of composition 
for binary quadratic forms will immediately recognize it in 
the above elementary construction; the form F correspond- 
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ing to the upper signs in those formulas (which clearly de- 
pends symmetrically upon f and f’) is the one derived from 
f and f’ by “composition”. No doubt the Gaussian theory, 
as Gauss chose to describe it (Disg.Art.234—260), is far 
more elaborate; so much so, indeed, that it remained a 
stumbling-block for all readers of the Disqutsitiones until 
Dirichlet restored its simplicity by going back very nearly 
to Legendre’s original construction (Dir.11.107—114]1851). 
Was Gauss inspired by Legendre’s Essai to develop his own 
theory? This seems at least plausible, notwithstanding Gauss’s 
statement that he did not see Legendre’s book until “the 
greater part” (“maxima pars”: Gau.1.7) of the Disquasitiones 
was in print; actually he started work on “composition” 
in the Fall of 1798 (cf. Gau.1.476, Zu Art.234) about the 
time when he visited PFAFF in Helmstedt (cf. his letter to 
BOLYAI of 29 Nov. 1798); perhaps he could have seen 
the Essai there’. 

Legendre seems to take it for granted that the classes of 
the two forms F which (because of the double sign) he derives 
from f and f’ depend only upon the classes of f and /”; this 
is not quite obvious, and would deserve proof. But there is 
at the same time an essential difference between his treatment 
and that of Gauss. Legendre never made the distinction 
between proper and improper equivalence; his classes of 
quadratic forms are the Lagrangian classes (cf. above, §IV). 
Thus it was left to Gauss to discover that the “Gaussian” 
classes, under the operation of composition, make up a 
finite commutative group, and to draw the consequences 
of this fact. If C is a “Gaussian” class, then the corresponding 
“Lagrangian” class is either C if C = C7’ in the sense of 
the group operation, or else it is the union of C and Col; 
if C’ is another “Gaussian” class, then composition in the 


? The relevant Section V of the Disquisitiones was not in print until 
much later; it contains the theory of ternary quadratic forms, which 
Gauss attacked only in February 1799 (cf. no.96 of the Tagebuch, Gau.X- 
1.539) and did not bring to completion before February 1800 (cf. 
ibid..no.103, Gau.X-1.545). 
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sense of Gauss produces a class CC’, while Legendre’s op- 
eration produces the Lagrangian classes corresponding both 
to CC’ and to CC’! and is thus a two-valued operation, 
which accounts for the double sign in his formulas; so it 
appears also in the tables (pp.432-434 of the Essaz) where 
he lists the result of his operation, e.g. for the determinant 
— 89. In this matter, too, Legendre had missed perhaps the 
most valuable feature of his discovery. 

One last satisfaction was granted Legendre in his old age. 
Since the work of Euler, Fermat’s so-called “last theorem”, 
for exponents greater than 4, had remained as a challenge 
to all arithmeticians. In 1798 and again in 1808 Legendre 
had been unable to do more than reproduce in the Essaz 
Fermat’s proof for exponent 4 and Euler’s for exponent 3 
(cf. above, Chap.II, §§X and XII, and Chap.III, §XIV). 
Interest in the problem was revived in Paris in the following 
decades, particularly after the Paris Academy, in 1816, made 
it the subject of their annual prize-competition for 1818; 
Olbers brought this to Gauss’s attention, and Gauss, with 
characteristic caution, answered that the problem had little 
interest for him as such, but that, with luck, its solution 
might perhaps turn up as a by-product of a wide extension 
of the higher arithmetic (“einer grossen Erweiterung der héheren 
Arithmetik: Gau.X-1.75|1816) which he was contemplating. 

In the meanwhile Sophie GERMAIN, whose talent had 
early attracted the notice of Lagrange, Legendre and Gauss, 
had started working on Fermat’s theorem, obtaining some 
valuable results based on ingenious congruence arguments 
(cf. e.g. H. M. Edwards, Fermat’s last theorem, Springer 1977, 
p.64, or P. Ribenboim, Thirteen lectures on Fermat's last theorem, 
Springer 1979, pp.53-—55). These need not be discussed 
here; what concerns us 1s rather the application of the infinite 
descent by Dirichlet and by Legendre, in the year 1825, to 
Fermat’s equation for exponent 5. 

Infinite descent 4 la Fermat depends ordinarily upon no 
more than the following simple observation: if the product 
a of two ordinary integers (resp. two integers in an algebraic 
number-field) is equal to an m-th power, and if the g.c.d. 


336 Number Theory Chap.IV, §VI 


of a and @ can take its values only in a given finite set of 
integers (resp. of ideals), then both a and B are m-th powers, 
up to factors which can take their values only in some as- 
signable finite set. For ordinary integers this is obvious; it 
is so for algebraic number-fields provided one takes for 
granted the finiteness of the number of ideal-classes and 
Dirichlet’s theorem about units. In the case of a quadratic 
number-field Q(VN), this can be replaced by equivalent 
statements about binary quadratic forms of discriminant N. 
A typical case is given by Euler’s proofs for Fermat’s theorem 
(cf. Chap.III, §XIV); there the initial step consists in writing 
the equation in the form 
(x—y)x—pya-j'y) = 2, 

where j = (—1+V—3)/2 is a cubic root of unity, and ap- 
plying the above principle to the factors in the left-hand 
side. As shown in Chap.II, Appendix I, and as Euler knew 
(loc.cit.), this can easily be replaced by an argument based 
on the theory of the form X* + 3Y?. 

In seeking to deal with Fermat’s equation x? — y? = 2’, 
where p is an odd prime >3, it is natural to split x? — y? 
into linear factors in the “cyclotomic field” Q(e), where e is 
a p-th root of unity; this was all the more tempting after 
1801, since Gauss had developed the algebraic side of the 
theory of Q(e) in section VII of the Dasquisitiones; it is clearly 
what Gauss had in mind in writing to Olbers in 1816 (loc.cit.). 
But ideal-theory had to be created first. 

Gauss himself, however, had shown how x? — y? can 
be split into factors in the quadratic number-field 
k= Q(V =p), where the sign is the one for which 
+p = | (mod 4); this field, as he had shown, is contained 
in Q(e), from which fact it follows that the polynomial 


7 
which (as Gauss had shown) is irreducible over Q, splits 


into two factors P + QV+p,P — QV+p, where P, Q are 
two polynomials of degree (p— 1)/2 inx, y, with half-integer 


= Poh t Py to + PT, 
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coefficients; this gives an identity 
F= P** pQ’, 


so that the principle described above can be applied to Fer- 
mat’s equation when it is written in the form 


(x—y(P+QV + pyP-QV +p) = 2. 


Moreover, instead of the theory of the quadratic field &, 
which was still to be worked out, one could, even in Le- 
gendre’s lifetime, apply the equivalent theory of the binary 
quadratic forms with the discriminant +, as it was known 
from the work of Lagrange and that of Gauss. 

This is precisely what was attempted in 1825 by a young 
student in Paris, Lejeune Dirichlet, then barely 20 years 
old; he had gone there to study mathematics, since this 
could hardly be done in Germany at the time. Naturally he 
tried his hand at the first outstanding case p = 5; he could 
do so all the more readily, since the required identity, viz.: 

xt + xPy + xPy? + xy? + y* = (x? +4bxy + y?)?— B(exy)? 
had been known since the days of Euler (Eu.J-3.280 in 
E 449|1772) who had derived from it some partial results 
concerning the form X* — 5Y? (cf. above, Chap.III, §1X). 

In this first attempt Dirichlet was only partly successful. 
If the equation is written symmetrically as x° + y®? + 2° = 0, 
where x, y, 2 have no common divisor, it is obvious that one 
of them must be even while the other two are odd. By con- 
sidering the congruence 


x + y? + 2 = 0 (mod 25) 


one sees also that one of the unknowns must be =0 
(mod 5) while the other two are not; the same conclusion 
can be derived, somewhat more elaborately, from Sophie 
Germain’s theorems mentioned above. At first Dirichlet’s 
treatment (by infinite descent) could be applied only to the 
case where one and the same of the three unknowns is a 
multiple of 2 and of 5. 

This is where Legendre, then well over 70 years old, 
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stepped in. After presenting Dirichlet’s paper to the Academy 
in July 1825 (cf. Dir.I.3—13), it took him only a few weeks 
to deal with the remaining case. The technique he used did 
not differ much from Dirichlet’s. 

This had perhaps been a modest Everest to climb, and 
Dirichlet had guided him almost to the top. But Legendre 
got there first. He described the whole proof in a paper 
presented to the Academy in September 1825 and printed 
in its Memoirs as a “Second Supplement” to his Essaz; in it 
he duly quoted “M' Sophie Germain” and also, strangely 
enough, one “Lejeune Dieterich”. The next year Dirichlet 
was back in Germany, preparing his proof for publication 
in Crelle’s newly founded journal where it appeared in 1828 
(Dir.1.21—46); in it he included the case he had had to omit 
previously. Legendre inserted his own version of the proof, 
with no reference to Dirichlet, in volume II of his Théorie 
des Nombres of 1830 (6° Partie, §IV, pp.361—368). Had he 
by that time persuaded himself that his alone was the glory? 
If so, the old man deserves forgiveness. Towards Jacobi 
and Abel he had behaved with great generosity, unstintingly 
lavishing praise on their discoveries in his other topic of 
predilection, the theory of elliptic functions. As to Dirichlet, 
he was soon to take his flight and soar to heights undreamt 
of by Legendre. 


Appendix I 


Hasse’s Principle for ‘Vernary Quadratic 
Forms 


By a form we shall understand here a homogeneous poly- 
nomial F of any degree 6 in some indeterminates X;, Xz, ..., 
X,, with coefficients in Z. Clearly the equation F = 0 hasa 
non-trivial solution in Q, i.e. a solution in rational numbers, 
not all 0, if and only if it has a “proper” solution in Z, i.e. 
a solution in integers without a common factor. On the 
other hand, we will say that a solution (x;, x9, ..., x,) of the 
congruence F = 0 (mod m) is proper if x), xo, ..., X, are integers 
whose g.c.d. is prime to m. Obviously, if m’ is prime to m, 
F = 0 has a proper solution modulo mm’ if and only if it 
has such solutions modulo m and modulo m’. 

Let G be another such form, with indeterminates Y,, Yo, 
..., Y,3 we shall say that the equations F(X) = 0, G(Y) = 0 
are equivalent if there is a linear substitution S with coefficients 
in Z, of determinant D # 0, transforming F into aG with 
a # 0; this means that we have 


FIS(Y)] = aG(Y). 


Then the substitution T = aD.S~' has also integral coeffi- 
cients, and we have 


G[T(X)] = a®'D®F(X), 


which shows that the relation between F and G is symmetric; 
as it is obviously reflexive and transitive, it isan equivalence 
relation in the usual sense. 


Lemma 1. Let the forms F, G be such that the equations F = 0, 
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G = 0 are equivalent; let p be a prime. Then the congruence 
G = 0 (mod p") has a proper solution for all w if 
and only if F = 0 has the same property. 


Notations being as above, let p* be the highest power of 
p dividing a; let x = (x, ..., x,) be a proper solution of F = 0 
(mod p“**); put y = T(x), and call p* the highest power of 
p dividing all the y;. Put y’ = py; as we have 


S(y') = p-S(y) = po*aDx, 
and as x is proper, p* must divide aD. Now we have 
aG(y") = F[S(y')] = (p-*aD)°F (x); 


as the right-hand side is =0 (mod p***), G(y’) must therefore 
be =0 (mod p*), so that y’ is a proper solution of that 
congruence. 

Using elementary properties of the ring of p-adic integers, 
it is easy to show that the assumption (resp. the conclusion) 
in lemma | amounts to saying that F = 0 (resp. G = 0) has 
a non-trivial solution in the p-adic field Q,. After this remark, 
the lemma becomes obvious. 

Here we shall be concerned exclusively with non-degen- 
erate quadratic forms; these are the forms of degree 2 

F(X) = >) ajXiX; 
ij=1 
for which the determinant det(a,) is not 0. The so-called 
Hasse-Minkowski principle says that for such a form the 
equation F = 0 has a non-trivial solution in Q (and therefore 
a proper solution in integers) if and only if it has one in 
the field R of real numbers and the congruence F = 0 
(mod m) has a proper solution for every integer m. The 
latter condition is tantamount to saying that the equation 
F = 0 has, for every prime p, a non-trivial solution in 
p-adic numbers’. The value of this principle for modern 


' For a proof, based on an elementary treatment of quadratic forms 
from the point of view of p-adic fields, cf. J.-P. Serre, Cours d’Arithmétique, 
P.U.F., Paris 1970, Chap.I-IV. 
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number theory is greatly enhanced by the fact that it can 
be extended to all algebraic number-fields”. As was discov- 
ered in substance by Legendre in 1785, a somewhat stronger 
result holds for ternary forms: 


Theorem 1. Let F(X, Y, Z) be a non-degenerate ternary quadratic 
form. Then the equation F = 0 has a non-trivial solution in Q if 
and only if it has one in R and the congruence F = 0 (mod p”) 
has a proper solution for every power p" of every odd prime p. 


Of course the condition is necessary; only the converse 
need be proved. By an elementary substitution, the equation 
F = 0 can be transformed into an equivalent one of the 
form 

F(X, Y, Z) = aX? + bY? + cZ’ = 0; 

here, as F is non-degenerate, a, b and c are not 0; as F = 0 
has solutions in R, a, b and ¢ are not all of the same sign. 
Assume that F satisfies the conditions in our theorem; in 
view of lemma 1, this implies that every equation G = 0 
equivalent to F =0 satisfies the same conditions. Now consider 
Lagrange’s method of solution for an equation F = 0, 
as it has been described in Chap.II, §X1V; in it one replaces 
F = 0, firstly by an equivalent equation Z* = AX? + BY? 
with square-free A and B, not both negative, then successively 
by similar equations Z* = MX* + NY®, each one equivalent 
to the next one, until one reaches an equation for which M 
is either 1 or does not satisfy any congruence M = m? 
(mod |N)). In the former case this last equation, and therefore 
also the original one, have non-trivial solutions. We shall 
prove our theorem by showing that the latter case cannot 
occur. 

In fact, if M does not satisfy any congruence M = m 
(mod |N]), then, since N is square-free, it must have a prime 
divisor p such that M does not satisfy any congruence M = 
m (mod p); clearly p must be odd. Consider the congruence 


Z? = MX? + NY? (mod p”); 


2 


* As to the history of Hasse’s discovery of his “principle”, cf. H. Hasse, 
Crelles J. 209 (1962), pp.3—4. 


342 Number Theory Chap.IV,App.1 


as we have seen, it follows from our assumptions about F 
that this has a proper solution (x, y, z). Here p cannot divide 
x; for, if it did, it would divide z; p? would divide Ny*; as N 
is square-free, p would divide y, and (x, y, z) would not be 
proper. Taking now x’ such that xx’ = | (mod p), we have 
M = (x'z)? (mod p), which contradicts the assumption 
about M. 

In order to derive from this Legendre’s original for- 
mulation of his theorem, we need some lemmas. 


Lemma 2. Let p be a prime. If a, b and ¢ are prime to p, the 
congruence aX* + bY? + cZ” =0 (mod p) has a proper solution. 


As we have seen, this is a theorem of Euler’s; for the 
proof, cf. above, Chap.III, §XI. 


Lemma 3. Let p be a prime; put F(X) = Xj.) aX 2 where all 
the a; are prime to p. Assume that the congruence F(X) = 0 (med p”) 
has a proper solution, with w = 1ifp # 2and w= 3 if p = 2. 
Then the congruence F(X) = 0 (mod p**') has a proper solution. 


Let (x), ..., X,) be a proper solution of F(X) = 0 (mod p*); 
then we can write F(x) = pr, and one of the x,, say x,, is 
prime to p. Put 

yy =x, + ptt resp. yy) = x, + 27" 


according as p # 2 or p = 2; put y; = x; for? = 2, ..., n. 
We have 
F(y) = p*(r+ 8a,x,t) (mod p**') 
with 6 = 2ifp # 2,6 = Lifp = 2. Taking ¢ such that 
r + 6a,x\t = 0 (mod p), 
we see that (y), ..., Yn) 18 a proper solution of F(X) = 0 
(mod p**'). 
Now take again an equation F = 0, with F of the form 
F(X, Y, Z) = aX? + bY? + 22, 


where a, b, c are not 0 and are not all of the same sign. If 
abc is not square-free, and e.g. a = a’'m® with m > 1, then, 
replacing Y, Z by mY, mZ, we can replace F = 0 by an equiv- 
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alent equation with the coefficients a’, b, c. If e.g. a and b 
have a common divisor d > 1, then, writing a = a’d, b = 
b'd and replacing Z by dZ, we can replace F = 0 by an 
equivalent equation with the coefficients a’, b’, cd. As labc| 
decreases in both procedures, we can, by repeating them 
if necessary, obtain an equation of the same type, equivalent 
to F = 0 and for which abc is square-free, or, what amounts 
to the same, for which a, b and c are square-free and pairwise 
mutually prime. For such equations, we have Legendre’s 
theorem (cf. above, §VI): 


Theorem 2. Let a, b, ¢ be integers, not all of the same sign, such 
that abe is square-free and not 0. Then the equation 


F(X, Y, Z) = aX? + bY? + cZ? = 0 


has a non-trivial solution in Q if and only if — bc, — ca and — ab 
are quadratic residues modulo |a|, modulo |b| and modulo |cl, 
respectively. 


If F = 0 has a non-trivial solution in Q, it has a proper 
solution (x, y, z) in Z. Here z must be prime to ab; for, if e.g. 
p is a prime dividing z and a, it must divide by’, hence y 
since } is prime to a; then p” divides ax* and p divides x since 
a is square-free, so that (x, y, z) is not proper. Take now 2z’ 
such that zz’ = 1 (mod |ad|); then we have — be = (byz’) 
(mod |a|) and —ca = (axz')’ (mod ||); the proof for — abd is 
of course similar. Therefore the conditions in the theorem 
are necessary. 

Assuming now that they are satisfied, we shall verify that 
those in theorem | are also satisfied. Let p by any odd prime. 
If it does not divide abc, then, by lemma 2, the congruence 
F = 0 (mod p) has a proper solution; in view of lemma 3, 
this implies, by induction on p, that it has such solutions 
modulo p* for all x. Now let p be a divisor of abc, e.g. of c 
and therefore not of ab. As — abis a quadratic residue modulo 
¢|, itis such modulo #, so that we can write —ab = m? (mod fp). 
Then (m, a) is a proper solution of the congruence 
aX® + bY*® = 0 (mod p); applying lemma 3 and induction 
on p, we see that, for every 4, the same congruence has a 
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proper solution (x, y) modulo p", so that (x, y, 0) is a proper 
solution of F = 0 (mod p*). This completes the proof. 


Corollary. Let N be a square-free integer, other than 0 or —1, 
let p be an odd prime, not a divisor of N. If pis a quadratic residue 
modulo 4|N|, it can be written as a® + Nb? with rational a 


and b, 


Apply Legendre’s theorem to the equation X* + NY? = 
pZ°. This will have a non-trivial solution in Q if p is a quadratic 
residue modulo |Nj and —N is a quadratic residue modulo 
p. In view of the results described in Chap.III, Appendix 
1, both conditions will be fulfilled if p is a quadratic residue 
modulo 4|N| (and also if p is a quadratic residue modulo 
IN| and N = — 1 (mod 4)). Then, if (x, y, z) is a non-trivial 
solution of X* + NY? = pZ?, z cannot be 0, so that p = 
a’ + Nb? with a = x/z, b = y/z. This had been predicted 
by Euler for the case N > 0, p = 1 (mod 4N) (cf. above, 
Chap.III, §1X, and Corr.1.605—606]1753). 

Finally, since Legendre applied this theorem to a proof 
(only partly successful) of the law of quadratic reciprocity, 
it seems worthwhile to comment briefly upon the relation 
between the two topics; for this purpose it is convenient to 
introduce a symbol [F'],, where F is a ternary form over Z 
and v is either a prime or the symbol ~. If p is a prime, we 
put [F], = + 1 if the equation F = 0 has a non-trivial solution 
in the field Q, of p-adic numbers, or, what amounts to the 
same, if the congruence F = 0 (mod p*) has a proper solution 
for all x; otherwise we put [F], = —1. We put [F]. = +1 
or — | according as F = 0 has non-trivial solutions in R or 
not. For F = Z? — aX? — bY?, [F], is nothing else than the 
so-called Hilbert symbol (a, 5),. As every equation F = 0 is 
equivalent to one of the latter type, it follows from lemmas 
1, 2 and 3 that, for any F, we have [F], = +1 for all except 
finitely many primes p. 

The Hasse-Minkowski principle in its most general form, 
applied to a ternary form F, says that F = 0 has a non- 
trivial solution in Q if and only if [F]. = +1 and [F], = 
+1 for all primes p. On the other hand, theorem | (or, 
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what amounts to the same, Legendre’s theorem) says that 
it is enough, for F = 0 to have a non-trivial solution in Q, 
that F should satisfy [FJ]. = +1 and [F], = +1 for all odd 
primes p; thus these conditions imply [F], = +1. 

This last result is actually a special case of Hilbert’s product- 
formula, valid for every ternary form F over Z: 


Fl. [LF], = 1 
p 
where the product is taken over all primes. This says in 
effect that the number of factors equal to — 1 in the left- 
hand side must be even. In particular, if all those factors 
except possibly [F]o are +1, then also [F], must be +1; 
this, combined with the Hasse-Minkowski principle, gives 
Legendre’s result. 

Hilbert’s formula contains the quadratic reciprocity law, 
from which in its turn it can be deduced by elementary 
considerations (cf.J.—P. Serre, loc.cit.). Apply it for instance 
to the form F = pX* + qY* — Z?, where p and q are odd 
primes and the sign is determined by + q = | (mod 4). Here 
we have [F].. = +1. It is easily seen that either (0, 1, 1) or 
(2, 1, 1) is a proper solution of F = 0 (mod 8); in view of 
lemma 3, this gives [F]e = +1. Lemmas 2 and 3 show that 
[F], = +1 for every prime r other than p and g. Finally, 
reasoning just as above in the proof of theorem 2, one gets 


rs (hin (2) 
[Fp (= [F'], ; 


so that these last symbols must be both +1 or both — 1. As 
we have observed in Chap.III, Appendix I, this is nothing 
else than the quadratic reciprocity law. The fact that Le- 
gendre’s theorem implies at any rate one special case of the 
product-formula accounts for his partial success in applying 
it to the proof of the law in question. 


Appendix II 


A Proof of Legendre’s on Positive Binary 
Quadratic Forms 


In his Additions to Euler’s Algebra, Lagrange had raised 
the question of finding the minimum of | f(x, y)| for a given 
binary form f with integral coefficients when x, y are integers, 
not both 0; he had solved it for quadratic forms, both definite 
and indefinite (Lag. VII.61—74, Art.31-36 = Eu.I-1.552- 
562), by applying to it the theory of continued fractions. 

In those same Additions, he had introduced the process 
of “Lagrangian reduction” (cf. above, §III), later devoting 
to it the whole of his Recherches of 1775. Actually there is a 
rather close connection between this topic and the minimum 
problem mentioned above, although there is nothing to 
show whether Lagrange was aware of this connection, which, 
in the case of definite forms, seems to have been first pointed 
out by Legendre in his Fssaz of 1798. It depends upon the 
following observations. 

For brevity’s sake, if f = (a, 6, c) is any binary quadratic 
form (in the notation explained above in §IV), we will write 
u(f) for the minimum of | f(x, y)| when x, y are given all 
integral values, not both 0. The forms for which p(f) = 0 
are those whose discriminant 6” — 4ac is 0 or a square; as 
we have excluded these once for all (cf. above, 81V), u(f) 
is an integer >0. 
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Lemma 1. Let f be a binary quadratic form (whose discriminant 
is neither 0 nor a square). Then there is a form F = (A, B, C) 
equivalent to f, such that A = +p(f) and |B| < |A| < |C|. 


Take (a, 6) such that | f(a, 6)| = wf). If we had a = da’, 
b = db’ with d > 1, we would have 


Ifla’, 6.) = |d-"fla, b)| < mf). 


Therefore a and 6 are mutually prime, so that f(a, 0) is 
properly represented by f. As shown above in §IV, there 
is then a form f’ = (m, n, p) equivalent to f with m = 
f(a, 6) = +w(f). As in Lagrangian reduction, take v such 
that —|m| <n — 2vm < |m]; then the substitution (X, Y) > 
(X—vY, Y) transforms f’ into a form F = (A, B, C) with 
A =m = +p/(f), |B] < |Al. As C is properly represented 
by F, it is properly represented by f, so that we have 
IC| = w(f), which proves the lemma. 

As Legendre discovered (cf. his Essai of 1798, I° Partie, 
§VIII), there is, for definite forms, a kind of converse to 
lemma 2. Stated in somewhat more precise form than in 
Legendre, it is as follows: 


Lemma 2. Let F = (A, B, C) be a binary quadratic form with 
A>0,0 =B<A CC. Then, if x, y are integers, we 
have F(x, y) > C except forx = 0,» = +1 or for A = B, 
x= —y = +1, in which cases we have F(x, y) = C, or possibly 
for y = 0, in which case we have F(x, 0) = Ax? (and F(x, 0) > A 
unless x = O or +1, F(x, 0) 2 A unless x = 0). 

Let D = B? — 4AC be the discriminant of F; it is <0. If 
F(x, y) = C, we have 

4AF(x, y) = (2Ax+By)? + |D|y? < 4AC 
and therefore y’ < 4AC|D|~'. As0 < B < A <C, we have 
\D| = 3AC; this gives y° < 4/3, y = Oor +1. Ify = +1, the 
above inequality can be rewritten as 
(2Ax+B)*? + |D| — 4AC = 4A(Ax? + Bx) < 0 


i.e. Ax? + Bx < 0; this cannot be so unless either x = 0 or 
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A = B,x = +1, in which cases we have F(x, y) = C. As to 
our statements for the case y = 0, they are obvious. 


Lemma 3. Let F = (A, B, C) be a form satisfying the conditions 
in lemma 2. Then w(F) = A; moreover, if C > A, we have 
F(a, b) = A if and only of (a, b) = (+1, 0); f C = A>B, 
we have F(a, 6) = A if and only if (a, b) = (£1, 0) or (0, + 1); 
if C = A = B, we have F(a, b) = A of and only if (a, 6) = 
(+1, 0), (0, 1) or (+1, ¥1). 


This follows at once from lemma 2. 


Theorem. Let F = (A, B,C), F' = (A', B’, C’) be two equivalent 
forms, both satisfying the conditions in lemma 2. Then F = F'. 


As F and F’ are equivalent, we can write 
F'(X, Y) = F(aX + BY, yX + 8Y), ad — By = +1; 


also, we have w(F) = w(F’) and therefore A = A’; this gives 
F(a,y) = A, and therefore, by lemma 3, y = 0,a@ = +1 
unless C = A. If C = A, there are, by lemma 3, four or six 
values of (a, 6) for which F(a, b) = A; therefore the same 
must be true of F’, so that, by the same lemma, we have 
C' = A’ = A;asF and F’ must have the same discriminant, 
and as B and B’' are 20, this gives B = B’, F = F’'. 
Now take the case C > A, y = 0; then we must have 
a= +1,6 = +1. Here we have 


B' = 2AaB + Bod = +2ABR + B, 


and hence 2AB = +B + B’. As B and B’' are 20 and SA, 
this implies B = B’, and therefore, as above, C = C’, 
Fe= F'. 

Now, for every form f, the process of Lagrangian reduction 
produces an equivalent form F = (A, B, C) with |B| < |A| 
and |B| < |C|; if fis positive-definite, both A and C must be 
>0, while if f is indefinite they must be of opposite signs 
since B° — 4AC is then >0 and |AC| is 2B”. As it is obvious 
that the forms (A, +B, C) and (C, +B, A) are all equivalent 
(cf. above, §IV), there must be, in each Lagrangian class of 
positive forms, at least one satisfying the conditions A > 0, 
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0=B<A<C of lemma 2; as Lagrange found, this is 
unique, and the above lemmas, in substance, make up Le- 
gendre’s proof (obtained, as he says, “par une méthode par- 
ticuliére”: Essai, p.xij) for this result. 

As to the determination of u(/f) for an indefinite form 
f, lemma | says that there is a form F = (A, B, C) equivalent 
to f and reduced “in the Lagrangian sense”, for which 
#(f) = |A|; thus the problem can be solved by constructing 
all such forms. Lagrange’s method for doing this will now 
be described in Appendix III. 


Appendix ITI 


A Proof of Lagrange’s on Indefinite Binary 
Quadratic Forms 


Here, for convenience, we shall use matrix notation in 
addition to the notations introduced above in §IV. Thus 


we write 
gs = (ek 
(*8) 


for the substitution given by (2), 81V. We write again fe S 
for the form F given by (1), 8IV, i.e. for the transform of the 
form f by the substitution S, and & for the group GL(2, Z) 
of the substitutions S with integral coefficients a, B, y, 5 
such that det S$ = +1. We write the group @ multiplicatively; 
thus, for S and T in &, ST will denote the substitution 


(x, y) > ST(x, y) = S[T(x, y)}s 


then, for a form f, we have fe ST = (f° S)° T. Of course 
those notations are foreign to Lagrange, having been de- 
veloped only in the nineteenth and twentieth century. 

Expressed in this language, Lagrange’s first observation 
is as follows: 


Lemma 1. Every substitution (: p in Gcan be changed, by 


multiplication to the right and to the left by substitutions of the 
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Multiplying it to the right by one of these substitutions, 
we can change it into one for which a > 0, B > 0 (if neither 
of them is 0) or else into one for which a and B are 20 and 
y, 6 are not of opposite signs (if a or B is 0). In the former 
case the relation ad — By = +1 implies that y, 6 are not 
of opposite signs. Multiplying then to the left with one of 


0+1 
We will write 9, for the set of those substitutions 


a B 


4 in Yfor which a, B, y, dare all 20; this isa semigroup, 


the substitutions I ‘) will do what was required. 


ie., the product of any substitutions in G, is itself in G,. 
As to these, Lagrange showed in substance that they can 
all be obtained by a succession of substitutions (x, y) > 
(y, x) and (x, y) > (x, y+ wx) with zp > 0. If we put, in matrix 


notation: 01 10 
re (to). t= (a). 


then these are the substitutions J, 7“; we have in fact 


»_ {10 we (Hl 2_ {10 
maui) um =(f} #= (at) 


With these notations, Lagrange’s result can be stated as 
follows: 


Theorem 1. Every substitution in G.. can be expressed as a product 
of substitutions J and T* with w > 0. 


Take S = (: 8) in G,; after replacing S by SJ if a < B, 


we may assume that a > B. In substance, Lagrange proceeds 
then by induction on £; following his procedure, we first 
consider the case B > |. As a = B, we can write a = wB + p 
with 0 < p < 8, just as in the euclidean process for finding 
the g.c.d. of a and £, or for constructing the continued 
fraction for a/B. Here, asad — By = +1, ais prime to B, 
so that p > 0. Then, putting 


= _ f= Bp 
y — 6, S (ge , 
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we have S = S'JT". AsS, J and T’ are in @, so is S’; therefore 
we have 


det S’ = Bo — pd = +1. 
This implies that o is =O, since otherwise we would have: 
Bo — pi = Bo = -B< —-1. 


Therefore S’ is in Y,.. We can then, as Lagrange does, operate 
on S’ in the same manner provided p > 1, or, what amounts 
to the same, apply the induction assumption to S’ provided 
the cases B = 1 and B = 0 have been dealt with first of all. 

In the case 8 = 1, we puta = wB + p with w = a and 
p = Oif a — By = —l, resp. uw =a — landp = lif 
ad — By = | (in which case a and 6 must be 21; Lagrange, 
otherwise so scrupulously careful in such matters, thought 
that this case cannot occur, even giving a fallacious argument 
to prove it: Lag. 111.733, lines 8—9).In either case, take again 
o = y—p0,1.e.0 = y — ad = Lresp.g = y — (2-1/8 = 
5—1. Then we get, as before, S = S'JT*, with 


ro 6 ro 1 t . 
S’ = T° resp. § -(} 1): 


repeating the same procedure on S’ in the latter case, we 
find S’ = T°~'JT, which completes the proof for B = 1. If 
B = 0, we must have a = 6 = land S= T”. 

For every 5S in G,, the above procedure gives an expression 


S = TTT ... [TR 


where all the yx; are 20, all are >0 except possibly the first 
and last one, and where the y,, for m > 0, determine a 
continued fraction for a/B: 


A = by + U/(ftmat + Af + 1M # 1/pn) ). 


It can be shown easily that the above expression for S is 
unique, but this point is not considered by Lagrange. 

We shall now describe Lagrange’s application of theorem 
I to the theory of indefinite binary quadratic forms; we 
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recall that we exclude the forms whose discriminant is 0 or 
a square. 

The process of Lagrangian reduction (Chap.IV, §IV; cf. 
Chap.III, §1X) applied to a given form leads to a form 
(A, B, C) reduced “in the Lagrangian sense”, i.e. for which 
\B| is <|A] and <|C|, so that B® < |AC|. Then B? — 4AC has 
the sign of —4AC, so that, if it is >0 as we are assuming 
here, A and C must be of opposite signs, i.e. AC < 0. It will 
be convenient to say that a form f = (a, 5, c) is weakly reduced 
if ac < 0; this is the same as to say that the polynomial 
f(X, 1) = aX? + bX + chas two real roots of opposite signs. 
Clearly there are only finitely many weakly reduced forms 
with a given discriminant. If f = (a, b, c) is weakly reduced, 
so is fe] = (c, 6, a). 

To bring clarity into Lagrange’s treatment of his problem, 
it is also necessary to give a name to those forms f for which 
f(X, 1) has a positive root >1 and a negative root >— 1; we 
shall call them strongly reduced. This concept is indeed implicit 
in Lagrange’s calculations, as well as in his solution of Pell’s 
equation and in his treatment of the continued fractions 
for quadratic irrationalities (cf. above, §I1(B)), two topics 
closely related to the present one. For the same reason it 
has already made its appearance (under the name of “re- 
duced forms”) in our discussion of Pell’s equation in Chap.I, 
§XIII. In the Disquisitiones Gauss introduced a closely related 
concept, also under the name of “reduced forms” (Disgq. 
Art.183); perhaps he had merely disengaged it from La- 
grange’s calculations and adapted it to his own purposes. 

Lagrange’s problem was to decide when two indefinite 
forms, reduced in his sense, are equivalent. His solution 
depends upon the facts expressed in the following lemmas. 
As usual, if € is any real number, we write [€] for the integer 
# determined by uw <€<y + 1, ie. for the largest integer 


<E. 


Lemma 2. Let f be a weakly reduced form, let & be the positive 
root of f(X, 1). Then, for > 0, the form f’ = f ° JT" ts weakly 
reduced if and only if w < &, strongly reduced if and only if 
mw = [E]; and the form f” = fe JT] ts never strongly reduced. 
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Put f(1, 0) = a, and call —7 the negative root of f(X, 1). 
Then: 


GY) = fuXtY, X) = al¥ +t w)X]- [¥ -(€— p)X]. 


This is weakly reduced if and only if € — x > 0. The roots 
of f’(X, 1) are 
é' _ 1 _mf — a1 
f- we Tne 
If w > 0, 7’ is >0 and <1; as to €’, it is >1 if and only if 
0<€-uw<l,te. w = [€]. Finally, the roots of f"(X, 1) 
are € — wand —y ~—yp, and the latter is <— 1. 

From this it follows that every weakly reduced form is 
equivalent to a strongly reduced one, viz., f° JT” for w = 
[é] if the positive root é of f(X, 1) is >1, and (feJ) ° JT* = 
foT* for w = [1/é] if € < 1 since in the latter case we can 
apply lemma 2 to f° J. As every form is equivalent to one 
reduced in the Lagrangian sense, and as every indefinite 
form, reduced in that sense, is weakly reduced, this shows 
that there is at least one strongly reduced form in every 
Lagrangian class of indefinite forms. In view of this and of 
lemma 2, Lagrange’s problem amounts to finding all the 
strongly reduced forms equivalent to a given one. The key 
to this is contained in the following lemma (cf. Lag. 1II.730— 
732): 


Lemma 3. Let S' be a substitution in G, ; put S = S']T" with 
some > 0. Let f be a weakly reduced form; let F' = f° S' and 
F = fe S be its transforms by S' and by S, respectively. Then, if 
F is weakly reduced, so is F'. 


Put f(1, 0) = a; let €, —7 be the positive and the negative 
root of f(X, 1), respectively; we have: 


[{X, VY) = A(X + nY)(X— e¥). 
As in the proof of theorem I, let S’ be given by S’ = 


Bp eo _ 
§ go]? 50 that we have S = ys witha = uwB+ p,y = 


pd + o. Put F = (A, B,C), F’ = (A’, B', C’). We have 
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A = fla, y) = a(at+ny(a-éy), 

C =A’ = f(B, 8) = a(B + 8)(B— £8), 

C' = f(p, 0) = a(p + no)(p— £0). 
Put: 


6=a — éy, C=Bp -€, wo =p ~ 0. 
To say that AC < 0 is the same as to say that @ and ¢ have 
opposite signs; we have to show that, when that is so, also 
¢ and w have opposite signs, or, what amounts to the same, 
that w has the sign of 9. This follows at once from the fact 
thatw = @ -— pl. 


Lemma 4. Let f be a weakly reduced form; let & be the positive 
root of f(X, 1); let w, v be integers, both >0; put f’ = fe JT", 
f" =f °JT”. Then, if f" is weakly reduced, f’ ts strongly reduced, 
Ea >1, and p = [€]. 


Applying lemma 3 to S’ = JT", S = S'JT” and to f, we 
see that f’ must be weakly reduced. As in the proof of lemma 
2, the positive root of f'(X, 1) is €' = 1/(€—). Now apply 
lemma 2 to f’ and f”; it shows that we have €’ > v; as v is 
21, this gives 0 < € — w <l,ie. w = [€]; by lemma 2, 
this implies that f’ is strongly reduced. 


Lemma 5. Let p11, be, +.) Um be mntegers, all >0; for 1 <i m, 
put 
S, = JT" JT"... JT". 


Let fo be a weakly reduced form; for | <i <m, put f, = fo Si, 
and assume that f,, is weakly reduced. Then f,, fo, .... fm—1 are 
strongly reduced; moreover, if €; is the positive root of f({X, 1), we 
have, for 1 <1< m, [&;_,] = pw, and &, is given by the continued 
fraction: 


fo = Mi + U/ (Me + U/ > + W/ (ui + 1/8)» ). 


From lemma 3 it follows that f,,- is weakly reduced, then 
successively, by induction on j, that f,,-; is weakly reduced 
for j = 2, 3, ...,m—1. The assertions in the lemma follow 
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then from lemma 4 applied to fj-1, fi, fis: for 
1<i<m-l. 

Let f be a weakly reduced form such that the positive root 
é of f(X, 1) is >1; then, for w = [€], we shall say that the 
strongly reduced form f’ = f° JT” is the Lagrangian transform, 
or more precisely the first Lagrangian transform of f; it is thus 
given by 


f'(X.Y) = f+, X). 


The same procedure can then be iterated, i.e. applied to 
f’, then to the transform f” of f’, etc., giving successive 
Lagrangian transforms f", f”, etc.; thus, in lemma 5, the 
forms/;, fo, ...,fm—1 are the first m— 1 successive Lagrangian 
transforms of fo. For instance, in Chap.II, §XIII, the forms 
(—1)'F;_,, fori = 1, 2, 3, etc., are the successive Lagrangian 
transforms of the weakly reduced form X? — NY”. 

From these facts Lagrange deduces the solution of his 
problem, formulating it in terms of forms “reduced in the 
Lagrangian sense”. As noted above, it can be formulated 
just as well (and more concisely and elegantly) in terms of 
“strongly reduced forms”; it is contained in the following 
theorem: 


Theorem 2. Let f = (a, b, c), F = (A, B, C) be two equivalent 
indefinite forms, both strongly reduced; put f° = (c, —8, a), 
F* = (C, ~B, A). Then either F or F” is one of the successive 
Lagrangian transforms of either f or f°. 

Let S be a substitution in Y, such that F = f° S. Take 
first the case when S is in 9,. Then, by theorem 1, we can 
write 


S = TT") ... JT 
where all the uz; are >0 except possibly the first and last one 
which are 20. If wo = 0, wu», > 0, lemma 5, together with 


lemma 2, shows that F is the m-th Lagrangian transform of 
f. If uw, were 0, we could write 


SJ = ST 
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with S' = J if m = 1, and anyway S’ in G,. This gives 
Fo] = fe ST; 

here, as fand F° J are weakly reduced, we can apply lemma 

3 tof, fe S' and Fe J; it shows that f° S’ must be weakly 


reduced. Then lemma 2, applied to f° S’, shows that the 
form 


Fa (foS')oJTy 
cannot be strongly reduced, which contradicts our as- 
sumption; thus p,, is >0. Now we show that fo must be 0. 
In fact, if #o > 0, we can write F = f’ ° So with f’ = fe J, 
So being given by 
So = JT] ... JT’; 

as f’ is weakly reduced, lemma 5 shows that f’ ° JT" is 
strongly reduced if m 2 1; the same is true by assumption 
if m = 0. Then lemma 2 shows that the positive root of 
f'(X, 1) must be >; but this cannot be, since that root is 
1/é if € is the positive root of f(X, 1), and we have é > 1 
since f is strongly reduced. This proves our theorem for S$ 
in G,. 


If S is not in G,, then, by lemma 1, it can be written as 
. . +1 0 
ES'E’ with S’ in G, and E, E' of the form ( 0 1). Call 


H the substitution (x, y) > (y, —x), 1e., in matrix form 


we have 


H®? = -1,,H] = —JH = (; 4) 


where 1, is the identity in G. Now each one of the substitutions 
E, E' can be expressed, either as + lo, or as +HJ = ¥ JH. 
Thus lemma 1| shows that every substitution S in Gcan be 
expressed in one of the forms +S’, +HS', +S’'H or +HS'H 
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with S' in G,. Consequently, if F = fe S, one of the forms 
F and F* = F oH is the transform, by a substitution S’ in 
@,, either of f or of f° = f° H. In view of what has been 
proved above, this completes the proof of our theorem. It 
should be noted that, if f = (a, b, c) is weakly (resp. strongly) 
reduced, then so is f° = f° H = (c, —6, a). 

To a result, equivalent in substance to theorem 2 (but 
expressed in terms of forms “reduced in the Lagrangian 
sense”), Lagrange adds one further remark. Since there are 
only finitely many reduced forms with a given discriminant, 
some form must occur twice in the sequence of the La- 
grangian transforms of any given form f, so that such a 
sequence must be periodic, at least from a certain point 
onwards. Thus, as Lagrange observes (Lag.I11.740—741), 
the sequence need not be continued beyond that point, so 
that the process is a finite one. 

Taking our cue, however, from the solution of Pell’s 
equation (Lag.11.494—496; cf. ibid. pp.429-443 and 603- 
615, and above, Chap.II, §XIII), we can proceed one step 
further. Let f, be a strongly reduced form; let f,, fo, etc., be 
the sequence of its Lagrangian transforms, with f; = 
fi-1°JT™ as before; call €; the positive root of f;(X, 1); by 
lemma 5, we have 


Mist = (6), G41 = ——,é, = pi + I/(ug + 1/ +); 
éi— Bist 
}41, Me, etc. are the successive integers occurring in the con- 
tinued fraction for &; as the sequence (fj) 1s periodic, at 
least from a certain point on, so is the sequence (y;). Now 
put f; = fie H. In view of the easily verified identity 


H(JT") 'H7! = —JT*, 
we have: 
fia = frost. 
As all the f; are strongly reduced, this shows that f es 


; . * 
fi etc., are the successive Lagrangian transforms of f;. 
Now, since the forms f; must repeat themselves, assume that 
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fisn = fi for some j and some n > 0. Then we have f;,, = 
fj and therefore fj,,-1 = fj-1 etc., and finally f, = fo, 
which implies f, = fo, fn+i = fis Mati = Mi for allz > 0. That 
being so, since we have f = f o, the successive Lagrangian 
transforms of fo are no other than fy_1, fa-2, -» fi and 
again fo. Consequently theorem 2 can be replaced by the 
following more precise statement: the strongly reduced 
forms equivalent to fy are fi, fo, 5 fr-1, and fo, fi, . 
f,-.if these are different from fo, fi, .. fo-1- 
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